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<110> QmniGene BioProducts, Inc. et al. 



<120> MICROORGANISMS MID PROCESSES FOR ENHANCED PRODUCTION OF 
PANTOTHENATE 



<130> 



BGI-154PC 



<160> 31 

<17d> Pa tent In Ver- 2.0 

<210> 1 
<211> 194 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: promoter 

sequence . ^ ■ 

<220> 

<221> -35_signal 
<222> (136) . . (141) 

<220> 
. <221> -lO^signal 
•<222> (159) . . (164) 

"<400> 1 

gctattgacg acagctatgg ttcactgtcc accaaccaaa actgtgctca gtaccgccaa 60 

tatttctccc ttgaggggta caaagaggtg tccctagaag agatccacgc tgtgtaaaaa 120 

ttttacaaaa aggtattgac tttccctaca gggtgtgtaa taatttaatt acaggcgggg 180 
gcaaccccgc ctgt 194 

<2io> 2 
<211> 163 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: promoter 
sequence 

<220> 

<221> -35_signal 
<2^2> (113)..(118) 



<220> 

<221> -10_signal 
<222> (136).. (141) 



.<4:00> 2 

gcctacctag cttccaagaa agatatccta acagcacaag agcggaaaga tgttttgttc 60 
tacatccaga acaacctctg ctaaaattcc tgaaaaattt tgcaaaaagt tgttgacttt 120 
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•i 



atctacaagg tgtggtataa teiatcttaac. aacagcagga cgc . 



163. 



<210:i 
<211> 
<212> 
<213> 




.1 



<220> 

:<223> 



Description of Artificial Sequence : prompter 
sequence ^ . . 



■<220> - 

<221>. -35_signal ' . 

;<222> (34) . . (39) : . 

- <220> ■ . . • ■ ' - ■ 

<221> -10_signal 
<222> (58)- -(€3) 

:<220> _ . ; , 

<221> -35_signal ■'•'.[ 
<222> (75).. (80) . 

<22p> • ; ■ ■ * • 

<22i> -l6_signal 

<222> (98) • • (103) . I 
<400> 3: 

gaggaatcat agaattttgt eaaaataatt ttattgacaa cgtcttatta acgttgatat 60 
aatttaaatt ttatttgaca aaaatgggct cgtgttgtac aataaatgta gtgaggtgga 120 
tgcaatg . • . .. . ; 127 



<210> 4 
<211> 24 
<212> DNA 



<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence :ribiosome 



binding site 



<400> 4: 

taaacatgag gaggagaaaa catg 



24 



!<213> Artificial Sequence 



<210> 5 
:;<211> 28 
<212> DNA 




:<220> 

<223> bescriptipn.iof; Artificial Sequence iribosome 



binding site 




<210> € 




28 
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<211> 13 
<212> DNA . ' 
<2i3> .'Artificial Sequence 

<220> 

<223> Description of Artificial Sequence : ribosome 
binding site ■ : 



<400> 6 

agaaaggagg tga . • 

<21Q> 7 ' ''■ 

<211> 23 • ■ 
<212> DNA 

<213> Artificial 'Sequence 
<220> 

<223> Description pf Artificial Sequence: ribosome 
binding site' 

<220> -./^ :'=;.: 

<221> ioisc^feature 
<222> 17/ 18, 19^ 20 ; 
<223> n « a, t, c, ^or .g 

<400> 7 

ttaagaaagg aggtigannnn atg 
<210> 8 . 

<211> :23 
<212> DNA 

<213> Artificial Sequence ; 

<220> • 
<223> Description of Artificial Sequence : ribosoine 
binding site 

<220> 

<221> misc_feature 
<222> 16, 17, 18,: 19, 20 
<223> n - a, Cr t, . or :g 

<400> 8 

ttagaaagga g^ganhwn :atgr 

<210> 9 ■ : : : « .J -i 

<211> 23 . 

<2i2> .pNA . ■ ^ : ': • . . ; 

<213> Airtificial :s4quence . M 

<220> 

<223> Description of Artificial Sequence : ribosome 
binding site , . 

<220> . ... . i I V ; • : . .. . 

<221> misc_feature 

<222> l^i, 15, :16;: 17, ;;18,- 19, 20 ; . ; : 
<^223> .11; » a,= c,. t> jor '-g : 



13 



23 



23 
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<406> 9 • ... 
agaaaggagg tgannnnnrm atg 

<210> 10 ;r ' 

<211> 22 ^ ' ' ' / 

<212> DNA 

<213> Artificial . Sequence 
<220> 

<223> Description of Artificial Sequence :xibosome 
binding site 

<220> 

<221> misc_fe4ture 

<222> 14, 15, 16, 17, 18, 19 

<223> n = a, (i, t, or g 

<400> 10 I 
agaaaggagg tgannnnnna tg 

<210> 11 . 
<211> 25 . . 

<212> DNA 

<213> Artificial Sequence 

.- .'. . . * 

<220> 

<223> Description of Artificial Sequence iribosoiae 
\ binding . site 

<400> 11 

ccctcta:gaa ggaggagaaa acatigr 
<210> 12 ■ ■ ' 

<211> 24 ... 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : ribosome 
binding i site 

<400> 12 : . 

ccctctagag gaggagaaaa catg 

<210> 13 
<2ll> 23 

<212> DNA i \ ; • 

<213> Artificial Sequence 

<220> 

<22;?> Description of Artificial Sequence : ribosome 
binding site 

<4bo> 13 ' ■ • •[ : ' : ^ 

ttagaaagga ^gatttaaat atg 

<2io> 14 • ; I ; ■ . I '. • 
<2ii> 23 ; ; r . .. 
<2ii>:.DNA • . 
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<213> Artificial Sequence 

. •<22ti> ■ . : • . ■ :• ' ■ 

<223> Description of Artificial Sequence : ribosome 
binding site 

<400> 14 

ttagaaagga ggtttaatta atg 

<210> 15 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : ribosome 
binding site 

<400> 15 

ttagaaagga ggtgatttaa atg 

<210> 16 
<2li> 23' 
<212> DNA 

<213> Artificial Sequence 
<22b> 

<223> Description ;of Artificial Sequence : ribosome 
binding site 

<400> 16 

ttagaaagga ggtgtttaaa atg 

<210> 17 - 
<211> 28 . 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: ribosome 
binding site 

<400> 17 

attcgagaaa ggaggtgaat ataatat^ : ' 

<2io> 18 ; : 

<211> 27 . ' . 

<212> DNA ■ ' ' 

<213> Artificial Sequence 

•<22p> \ \ ; : ■ ; 

<223> Description ipf Artificial Sequence : ribosome 
binding site , ! ' ' 

<400> 18^ ; : . - ; 

attcgagaaa ggaggtgaat aataatg! 

<2i0> 19^ ; ■ " . • - • ^ 

<211> 28 - ■ i 
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<2-12> :pna. ■ : . i ; ^ 

<2i3> 'Artificial Sequenpe 
<220> i ^: . : ' - . • 

.<223> ;Deseri,ption of Artificial Sequence rribosome 
jbindirig site 

<400> jl9 

attcgtagaa aggaggtgaa ttaatatg 
<210> 20 : . ' . 

<211>*:5i 

<212> DNA - ' , 

<213> Artificial Sequence : 

<2'20> 

<223> ; Description of Artificial Sequence: 5' PGR primer 
<223> ifor serA gene 

<400> ;20' ' 

cc.ctctagag gaggagaaaa catgtttcga gtattggtct cagacaaaat g 

<210> 121 
<211> ^43 
<212> DNA 

<213> Artificial . Sequence 



.<220> 

<2 2 3> Description of Artificial Sequence : 3 ' PGR primer 
<223> -for serA gene 

<400> ;2i 

cccggatcca attatggcag atcaatgagc ttcacagaca caa 

<210> 22 - . 

<211> .48 

<212>:dna 

<213> ; Artificial Sequ^^nce 
<22b>= 

<223> Description of Aritifiicial Sequencers' PGR primer 
<223>;for glyA gene 

: i . I 

<400> :22:* . ' : . ; 

ggatctagag gaggtgtaaa qatgaaacat ttacctgcgc aagacgaa 
<2io> 23 

<211> ^43^ . 
<212> :DNA^ : . 

<213>iArtificiai; Sequence , ; \ ' 

<22o>-: - ^ ; 

<223> Description pf Artificial Sequence: 3' PGR prime r^ 
<223> ifor igiyk: gene :\. ' 

<400> ;23- ■/ . • • ; . 

cgggg4tccG ccatcaabaa :tWc4cacttt ctattgkttc tac 

<210> -24: • : ^ ' ■ :. ■ : 
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<211> 7926 . . 
<212>.DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :serA overexpression 
<223> plasmid 

<400> 24 

gaattttgcg gccgcttcga aagctgtaat ataaaaacct tcttcaacta acggggcagg 60 
ttagtgacat tagaaaaccg actgtaaaaa gtacagtcgg cattatctca tattataaaa 120 
gccagtcatt aggcctatct gacaattcct gaatagagtt cataaacaat cctgcatgat 180 
aaccatcaca aacagaa.tga tgtacctgta aagatagcgg taaatatatt gaattacctt 240 
tattaatgaa ttttcctgct gtaataatgg gtagaaggta attactatta ttattgatat 300 
ttaagttaaa cccagtaaat gaagtccatg gaataataga aagagaaaaa gcattttcag 360 
gtataggtgt. tttgggaaac aatttccccg aaccattata tttctctaca tcagaaaggt 420 
ataaatcata aaactctttg aagtcattct ttacaggagt ccaaatacca gagaatgttt 480 
tagatacacc atcaaaaatt gtataaagtg gctctaactt atcccaataa cctaactctc 540 
cgtcgctatt gtaaccagtt ctaaaagctg tatttgagtt tatcaccctt gtcactaaga 600 
aaataaatgc agggtaaaat ttatatcctt cttgttttat gtttcggtat aaaacadtaa 660 
tatcaatttc tgtggttata ctaaaagtcg tttgttggtt caaataatga ttaaatatct 720 
cttttctctt ccaattgtct aaatcaattt tattaaagtt catttgatat gcctcctaaa 78.0 
tttttatcta aagtgaattt aggaggctta cttgtctgct ttcttcatta gaatcaatcc 840 
ttttttaaaa gtcaatatta ctgtaacata aatatatatt ttaaaaatat cccactttat 900 
ccaattttcg tttgttgaac taatgggtgc tttagttgaa gaataaagac cacattaaaa 960 
aatgtggtct tttgtgtttt tttaaaggat ttgagcgtag cgaaaaatcc ttttctttct 1020 
tatcttgata ataagggtaa ctattgaatt cggtaccaag agfcttgtaga aacgcaaaaa 1080 
ggccat<pcgt caggatggcc ttctgcttaa tttgatgcct ggcagtttat ggcgggcgtc 1140 
ctgcccgcca ccctccgggc cgttgcttcg caacgttcaa atccgctccc ggcggatttg 1200 
tcctactcag gagagcgttc accgacaaac aacagataaa acgaaaggcc cagtctttcg 1260 
actgagcctt tcgttttatt tgatgcctgg cagttcccta ctctcgcatg gggagacccc 1320 
acactaccat cggcgctacg gcgtttcact tctgagttcg gcatggggtc aggtgggacc 1380 
accgcgctac tgccgccagg caaattctgt tttatcagac cgcttctgcg ttctgattta 14 40 
atctgtatca ggctgaaaat cttctctcat ccgccaaaac aggatccaat tatggcagat 1500 
caatgagctt cacagacaca atatcaggga catttgttag ttctttcaca attttatctt 1560 
ccagatgtct gtcaaaggaa agcatcatga tggcttctcc gcctttttcc ttacggccaa 1620 
.cctgcatagt tgcaatgtta atatcattat ctccgagaat acgtcctact cggccgatga 1680 
cacctgttgt atcttgatgc tggatataca ccaagtgacc agtcggataa aaatcaatat 17 40 
taaatccatt gatctcgaca attcgttctc cgaaatgagg aatatacgta gccgttacag 1800 
taaaggtgct gcggtctcct gtcactttta cgctgatgca gttatcgtat ccagattcag 1860 
aagaggaaat tttttcactg aagctaatgc cgcgttcttt tgcgacaccc ccggcattga 1920 
cctcattaac agtagagtct acgcgcggtt ttaaaaagcc tgacagaagg gcttttgtaa 1980 
tgaacgatgt ttcaagttta gcaattgtgc cttcatattg aatggcaaca tcctgtactg 2040 
gttctttcat gcactgtgat acaaggctgc caatttttcc tgcaatttga tggtaaggct 2100 
taattttagc aaattcatct tttgtcatgg caggcaggtt gatagctgac atgacaggca 2160 
ggccttttgc gaactgcaga acttcttctg acacttgggc ggcgacattg agctgtgctt 2220 
ctttcgttga tgctcccaag tgaggagtgg caatgactaa tggatgatca acaagtttgt 2280 
tgtcaactgg cggttcgact tcgaaaacgt caagcgctgc tcccgcaaca tgcccgtttt 2340 
ccaaagcttc gagaagtgct gcttcatcga taattccgcc tcgcgcacag ttaattaagc 2400 
gaacgccttt tttcgttttt gcaatcgttt ctttattcaa taagcctttt gtttcttttg 2460 
ttaaaggcgt gtgaacggta atgatatccg cactttcaag cacttcttca aatgtacggc 2520 
tgtttacgcc gatttttttc gctctttctt ccgttaagaa aggatcaaaa acgtgcacag 2580 
tcataccgaa cgctcctcga cgctgtgcaa tttcacttcc gattcggcct aatcctacaa 2640 
taccaagcgt ttttccataa agctctgaac cgacataagc tgtgcggttc cactctctgg 2700 
atttcactga gatattagcc tgcggaatgt gtctcattaa agaagagatc attgcaaatg 27 60 
tatgctcagc tgtcgaaatg gtgttgccgt tcggagcatt gatcacgatt accccgtgtt 2820 
tcgtagcctc atcaatatcg atattatcga caccgacacc ggctcttccg acaattttta 2880 
aagaagtcat tttgttgaaa aggtcttctg ttacttttgt cgcgcttcgc accaaaagag 2940 
catcaaaagt atgtaattca tcttctgcat ctgctacgtt tttttgaacg atttcaataa 3000 
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agtctgattc aataagtggc tgtaaaccgt cgrttgctcat tttgtqtgag accaat act c 3060 
gaaacatgtt ttctcctcct ctagagcgtc ctgctgttgt taagattatt ataccacacc 3120 
ttgtagataa agtcaacaac tttttgcaaa atttttcagg aattttagca gaggttgttc 3180 
tggatgtaga acaaaacatc tttccgctct tgtgctgtta ggatatcttt cttggaagct 3240 
aggtaggcct cgagttatgg cagttggtta aaaggaaaca aaaagaccgt tttcacacaa 3300 
aacggtcttt ttcgatttct ttttacagtc acagccactt ttgcaaaaac cggacagctt 3360 
catgccttat aactgctgtt tcggtcgaca agcttcgcga agcggccgca a^aattcactg 3420 
gccgtcgttt tacaacgtcg tgactgggaa aaccctggcg ttacccaact taatcgcctt 34 80 
gcagcacatc cccctttcgc cagctggcgt aatagcgaag aggcccgcac cgatcgccct 3540 
tcccaacagt tgcgcagcct gaatggcgaa tggcgcctga tgcggtattt tctccttacg 3600 
catctgtgcg gtatttcaca ccgca;tatgg tgcactctca gtacaatctg ctctgatgcc 3660 
gcatagttaa gccagccccg acacccgcca acacccgctg actatgcttg taaaccgttt 3720 
tgtgaaaaaa tttttaaaat aaaaaagggg acctctaggg tccccaatta attagtaata 3780 
taatctatta aaggtcattc aaaaggtcat ecaccggatc agcttagtaa agccctcgct 3840 
agattttaat gcggatgttg cgattacttc gccaactatt gcgataacaa gaaaaagcca 3900 
gcctttcatg atatatctcc caatttgtgt agggcttatt atgcacgctt aaaaataata 3960 
aaagcagact tgacctgata gtttggctgt gagcaattat gtgcttagtg catctaacgc 4020 
ttgagttaag ccgcgccgcg aagcggcgtc ggcttgaacg aattgttaga cattatttgc 4080 
cgactacctt ggtgatctcg cctttcacgt agtggacaaa ttcttccaac tgatctgcgc 4140 
gcgaggccaa gcgatcttct tcttgtccaa gataagcctg tctagcttca agtatgacgg 4200 
gctgatactg ggccggcagg cgctccattg cccagtcggc agcgacatcc ttcggcgcga 4260 
ttttgccggt tactgcgctg taccaaatge gggacaacgt aagcactaca tttcgctcat 4320 
cgccagccca gtcgggcggc gagttccata gcgttaaggt ttcatttagc gcctcaaata 4380 
gatcctgttc aggaaccgga tcaaagagtt cct6cgccgc tggacctacc aaggcaacgc 444 0 
tatgttctct tgcttttgtc agcaagatag ccagatcaat gtcgatcgtg gctggctcga 4500 
agatacctgc aagaatgtca ttgcgctgec attctccaaa ttgcagttcg cgcttagctg 4560 
gataacgcca cggaatgatg tcgtcgtgca caacaatggt gacttctaca . gcgcggagaa 4620 
tctcgctctc tccaggggaa gccgaagttt ccaaaaggtc gttgatcaaa gctcgccgcg 4680 
ttgtttcatc aagccttacg gtcaccgtaa ccagcaaatc aatatcactg tgtggcttca 4740 
ggccgccatc cactgcggag ccgtacaaat gtadsggccag caacgtcggt tcgagatggc 4800 
gctcgatgac gccaactacc tctgatagtt gagtcgatac ttcggcgatc accgcttccc 4860 
tcatgatgtt taactttgtt ttagggcgac tgccctgctg cgtaacatcg ttgctgctcc 4920 
ataacatcaa acatcgaccc acggcgtaac gcgcttgctg cttggatgcc cgaggcatag 4980 
aGtgtacccc aaaaaaacag tcataacaag ccatgaaaac cgccactgcg ccgttaccac 5040 
cgctgcgttc ggtcaaggtt ctggaccagt tgcgtgagcg catacgctac ttgcattaca 5100 
gcttacgaac cgaacaggct tatgtccact gggttcgtgc cttcatccgt ttccacggtg' 5160 
tgcgtcaccc ggcaaccttg ggcagcagcg aagtcgaggc atttctgtcc tggctggcga 5220 
acgagcgcaa ggtttcggtc tccacgcaitc gtcaggcatt ggcggccttg ctgttcttct 5280 
acggcaaggt gctgtgcacg gatctgccct ggcttcagga gatcggaaga cctcggccgt 5340 
cgcggcgctt gccggtggtg ctgaccccgg atgaagtggt tcgcatcctc ggttttctgg 5400 
aaggcgagca tcgtttgttc gcccagcttc tgtatggaac gggcatgcgg atcagtgagg 54 60 
gtttgcaact gcgggtcaag gatctggatt tcgatcacgg cacgatcatc gtgcgggagg 5520 
gcaagggctc caaggatcgg gccttgatgt tacccgagag cttggcaccc agcctgcgcg 5580 
agcaggggaa ttgatccggt ggatgacctt ttgaatgacc tttaatagat tatattacta 5640 
attaattggg gaccctagag gtcccctttt ttattttaaa aattttttca caaaacggtt 5700 
tacaagcata acgggttttg ctgcccgcaa acg|rgctgtt ctggtgttgc tagtttgtta 5760 
tcagaatcgc agatccggct tcaggtttgc cggctgaaag cgctatttct tccagaattg 5820 
ccatgatttt ttccccacgg gaggcgtcac tggctcccgt gttgtcggca gctttgattc 5880 
gataagcagc atcgcctgtt tcaggctgtc tatgtgtgac tgttgagctg taacaagttg 5940 
tctcaggtgt tcaatttcat gttctagttg ctttgtttta ctggtttcac ctgttctatt 6000 
aggtgttaca tgctgttcat ctgttacatt gtcgatctgt tcatggtgaa cagctttaaa 6060 
tgcaccaaaa actcgtaaaa gctctgatgt atctatcttt tttacacegt tttcatctgt 6120 
gcatatggac agttttccct ttgatatcta acggtgaaca gttgttctac ttttgtttgt 6180 
tagtcttgat gcttcactga tagatacaag agccataaga acctcagatc cttccgtatt 6240 
tagccagtat gttctctagt gtggttcgtt gtttttgcgt gagccatgag aacgaaccat 6300 
tgagatcatg cttactttgc atgtcactca aaaattttgc ctcaaaactg gtgagctgaa 6360 
tttttgcagt taaagcatcg tgtagtgttt ttcttagtcc gttacgtagg taggaatctg 6420 
atgt:aatggt tgttggtatt ttgtcaGcat tcatttttat ctggttgttc tcaagttcgg 6480 
ttacgagatc catttgtcta tctagttcaa cttggaaaat caacgtatca gtcgggcggc 6540 
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ctcgcttatc 

tcaaaaccca; 

acttaaattc' 

cttttaiajiaa 

ccagattata. 

aaactaattc 

agcctttaac= 

tagctaatac 

ajagtgaacga 

ccacaca^ca 

gttcatttgc 

aatcactata 

ttgtgggtat 

cctctgtaaa 

tttatagaat 

actcactact 

Gtctacaaaa^ 

aatcgctgaa 

gacattcagt 

cgccttttat 

ttctcagggc 

cagttcctgc 

cagactggct 

gtcaac 



aaccaccaat 
ttggttaagc 
atcaaggcta 
ccactcataa 
ttttatgaat 
taatttttcg 
caaaggattc 
accataagca 
taccgtccgt 
taaaattagc 
tttgaaaaca 
ccaattgaga 
ctgtaaattc 
ttccgctaga 
aaagaaagaa 
ttagtcagtt 
cagaccttaa 
tattcctttt 
tcgctgcgct 
ggattcatgc 
gttttatggc 
cctctgattt 
aatgcaccca 



ttcatattgc 
cttittaaarct 
atctctaEtat 
atcctcatiag 
ttttttaact 
cttgagaact 
ctgatttpca 
ttttccctac 
tctttccttg 
ttggtttdat 
actaattdag 
tgggctagtc 
tgctagaicct 
cctttgtg^tg 
taaaaaaaga 
ccgcagtaitt 
aaccctaaag 
gtctccgacc 
cacggctctg 
aaggaaaicta 

gggtctgqta 

tccagtctga 
gtaaggca'gc 



tgtaagtgtt 
catggtagtt 
ttgccttjgtg 
agtatttigtt 
ggaaaagata 
tggcatagtt 
cagttctcgt 
tgatgttcat 
tagggttttc 
gctccgttaa 
acatacatct 
aatgataatt 
ttgctggaaa 
ttttttttgt 
taaaaagaat 
acaaaaggat 
gcttaagtag 
atcaggcacc 
gcagtgaatg 
cccataatac 
tgtggtgcta 
ccacttcgga 
ggtatcatca 



taaatcttta 
eittttcaagc 
agttttctfct 
ttcaaaagac 
aggcaatatc 
tgtccactgg 
catcagctct 
catctgagcg 
aatcgtgggg 
gtcatagcga 
caattggtct 
actagtcctt 
aicttgtaaat 
ttatattcaa 
agatcccagb 
gtcgcaaacg 
caccctcgca 
tgagtcgctg 
gg^ggtaaatg 
aagaaaagcc 
tctgactttt 
ttatcccgtjg. 
acaggcttaic 



cttattggtt 
atta.acatga 
t^tgttagtt 
ttaacat'gtt 
tcttcactaa 
aaaatctcaa 
ctggttgctt 
tattggttat 
ttgagtagtg 
ctaatcgcta 
aggtgatttt 
ttcctttgag 
tctgctagac 
gtggttataa 
cctgtgtat4 
ctgtttgctc 
agctcgggca 
tctttttcgt 
gcactacagg 
cgtcacgggc 
tgctgttcag 
acaggtcatt 
ccgtcttact 



6600 
6660 
6720 
16:^80 
6840 
6900 
6960 
.7020 
7080 
7140 
7200 
7260 
732Q 
7380 
.7440 
7500 
7i560 
7620 
7680 
7740 
7800 
7860 
7920 
7926 
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'<212> DNA 

<213> Artificial .Sequence 
,<220> • * 

<223> Description of Artificial Sequence :glyA o ver expires sion 
<223> plasmid 



<400> 25 
gaattttgcg 
ttagtgacat 
gccagtcatt 
aaccatcaca 
tattaatgaa 
ttaagttaaa 
gtataggtgt 
: ataaatcata 
tagatacacc 
cgtcgctatt; 
aaataaat gc \ 
tatcaatttc 
cttttctctt 
tttttatcta= 
ttttttaaaa 
•ccaattttcg; 
aatgtggtct 
tatcttgata. 
ggccatcbgt 
ctgcccgcca : 
tcctactcag: 
actgagcctt 
acactacbat 
accg<3gctac.\ 



gccgcttcga 
tagaaaaccg 
aggc.ctatct 
aacagaatga 
ttttcctgct 
cccagtaaat 
ttt gggaaac 
aaactctttg 
atc'aaaaatt 
gtaaccagtl: 
agggtaaaat 
tgtggttata 
ccaattgtct 
aagtgaattt 
gtcaatatta 
tttgttgaac 
ttt^tgtttt 
ataagggtaa 
caggatggcc 
ccctccgggc 
gagagcgttc 
tcgttttatt 
cggcgctacg 
tgccgccagg 



aagctgtaat 
actgtaaaaa 
gacaattcct 
tgtacctgta 
gtaataatgg 
gaagtccatg 
aatttccccg 
aagtcatt.ct 
gtataaagtig 
ctaaaagctg 
ttatatcctt 
ctaaaagtcg 
aaatcaattt 
agfgaggctta 
ctgtaaca|ta 
taatgggt^G 
tttaaaggat 
ctattgaaitt' 
ttctgcttaa 
pgttgcttcg 
accgacaakc 
tgatgcctgg 
:^.cc(tttcact 
caaattctgt 



ataaaaacbt 

gtacagtcigg 

gaatagagtt 

aagatagcgg 

gtagaaggta 

gaataataga 

aaccattata 

ttacaggagt 

gctctaactt, 

tatttgagtt 

cttgttttat 

tttgttggtt 

tattaaagtt 

cttgtctgct 

aatatatatt 

tttagttgaa^ 

ttgagcgftag 

cggtaccaag 

tttgatgcGt 

caacgttcaa 

aacagataaa 

cagttccjDiba 

tctgagt^teg 

tttatcagac 



tcttcaacta 
cattatctca 
cataaacaat 
taaatatatt 
attactatta 
aagagaaaaa 
tttctctaca 
ccaaatacca 
aitbccaataa 
ta:tcaccGtt 
gtttcggtat 
caaataatga 
catttgatat 
tticttcatta 
ttaaaaatat 
gaataaaga^ 
cgaaaaatcb 
agtttgtaga 
ggcagtttat 
atdcgctccc 
acgaaaggcc 
Gtictcgcatg 
gcatggggt'G 
cgpttctgcg. 



acggggcagg 
tattataaaa 
cctgcatgat 
gaattacctt 
ttattgatat 
gcattttcag 
tqagaaaggt 
gagaatgttt 
cctaactctc 
gtcactaaga 
aaaacactaa 
ttaaatatct 
gcctcctaaa 
gaatcaatcc 
cccactttat 
cacattaaaa 
ttttctttct 
aacgcaaaaa 
ggcgggcgtc 
ggcggatttg 
cagtctttcg 
gggagacQCG 
aggtgggacc 
ttctgattta 



60 

120 

180 

240 

300 

:360 

420 

480 

540 

.600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 
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atctgtatca! ggctgaaaat cttctctcat ccgccaaaac aggatccccc latcaacaatt 1500 
acacacttct attgattcta caaaaaaaga cattgagttt. caagaac^tc gtcaaaaaac 1560 
ccgccgggca taagcpcaag cgggttttag gatcttaata atctaattct ttatataaag 1620 
gaaatttatc agtcagagca gctacacgdt gtcttgcttc ttcaagtttt ccttcatctt 1680 
cgtggttttt caatgcaagc gcaatgatag caccgacttc ttctaatgcg tctccgtcaa 1740 
aaccgcggct ggttacagca gctgtaccaa gacggatgcc . gcttgttacg aaaggttttt 1800 
cag^ateata tggaatcgcg ttttt^tag acgtaatacc aatttcatca iagtacatgct 1860 
.ccgcaacctt accagtcagt ccgagcgaac gaaggtcaac aaggataagg tggttgtctg 1920 
ttccgcctga aacgagctgg atgccctctt tcgttaaggc ttcagccaga cgtttcgcgt 1980 
ttgaaatgac gttttgtgca tatgttttga aatcgtcctg caatacttca ccgaatgaaa 2040 
cagcttttgc ggcaataacg tgcatcagag ggccgccttg aattccaggg aagatcgatt 2100 
tatcaatttt cttgccaaac tcttcacggc aaaggatcat accgccgcga ggaccgcgaa 2160 
gtgttttatg tgttgttgtt gtaacgaaat cagcgtaagg aaccgggttt ggatgaaggc 2220 
ctgccgcaac aagtcctgcg atatgtgcca tatccaccat gaagtaagcg: ccgacttcat 2280 
cagcaatttc apggaatttc ttaaagtcga ttgtacgagg atacgcactt gctcctgcta 2340 
cgataagctt cggtttatga gcgagggctt ttfdacgcac gtcatcgtaa tcaatatatt 2400 
gagtttcttt atctacgccg tactcaacaa agttatattg aacaccgctg aagttgactg 2460 
ggcttccgtg tgttaaatgg ccgccgtggg agaggttcat cccaagtaca gtatcgcctt 2520 
gctccaaaat cgtgaagtac actgccatgt ttgcttgtgc gcctgaatga ggctgaacgt 2580 
ttajcatgctc cgetccaaag atttccttcg cgcggtcacg ggcgatatct tcaacgacat 2640 
cgacgtgctc gqatccgccg tagtagcgtt tgcccggata tccttctgcg tacttatttg 2700 
tcaaaacaga tccttgtigct; tccataaccg cttcacttac aaagttctca :gaagcaatca 2760 
attcgatctt agtctigttgg icgttcacgct catttttaat ggcgttaaac acttgttcgt 2820 
cttgcgpagg taaatgtttc atgtttacac ctcGtctaga gcgtcctgct gttgttaaga 2880 
ttattatacc acaccttgta gataaagtca acaacttttt gcaaaatttt tcaggaattt 2940 
tagcagaggt tgttctggat gtagaacaaa acatctttcc gdtcttgtgc tgttaggata 3000 
tctttcttgg. aagctaggta ggcctcgagt tatggcagtt ggttaaaagg aaacaaaaag 3060 
accgttttea cacaaaacgg tctttttcga tttcttttta cagtcacagc cacttttgca 3120 
aaaaccggaC; agcttcatgc cttataactg ctgtttcggt cgacaagctt cgcgaagcgg 3180 
ccgcaaaatt cactggccgt cgtttfacaa cgtcgtgact gggaaaaccc tggcgttaqc 3240 
caacttaatc gccttgcagb acatccccct ttcgccagct ggcgtaatag cgaagaggcc 3300 
cgcaccgatc gcccttccca acagttgcgc agcctgaatg gcgaatggcg cctgatgcgg 3360 
tattttctcc ttacgcatct gtgcggtatt tcacaccgca tatggtgcac tctcagtaca 3420 
atctgctctg atgccgcatia gttaagccag ccqcgacacc cgccaacacc cgctgactat 34 80 
gcttgtaaac cgttttgtga aaaaattttt aaaataaaaa aggggacctc tagggtcccc 3540 
: - aattaattag taatataatc^ tattaaaggt cattcaaaag gtcatccacc ggatcagctt 3600 
agtaaagccc tcgctagatt ttaatgcgga tgttgcgatt acttcgccaa ctattgcgat 3660 
aacaagaaaa agccagcctt tcatgatata tctcccaatt tgtgtagggc ttattatgca 3720 
cgcttaaaaa. taataaaagc agacttgacc tgatagtttg gctgtgagca attatgtgct 3780 
j tagtgcatct aacgcttgag ttaagccgcg ccgcgaagcg gcgtcggctt gaacgaattg 3840 

ttagacatta tttgccgact: accttggtga tctpgccttt cacgtagtgg acaaattctt 3900 
I ccaactgatc tgcgcgcgag gccaacfcgat cttcttcttg* tccaagataa gcctgtctag 3960 

cttcaagtat gac^gjgctga tactgggccg gcaggcgctc cattgcccag tcggcatrcga 4020 
catccttcgg cgcgaHitttg ccggttactg cgctgtacca aa^ 4080 
ctacatttcg ctcatcgcca gccca^cgg : gcggcgagtt ccatagcgtt aaggtttcat 4140 

. : ttagcgcctc aaatagatcc tgttcaggaa ccggatcaaa gagttcctcc gccgctggac 4200 
ctaccaaggc aacgctatgt tctcttgctt ttgtcagcaa gatagccaga tcaatgtcga 4260 
tcgtggctgg ctcgaagata cctgcaagaa tgtcattgcg. 4320 
: gttcgcgctt agctggataa cgccacggaa tgatgtcgtc gtgcacaaca atggtgactt 4380 

: ; ctacagcgqg gagaatctcg ctctctecag . gggaagccga agtttccaaa aggtcgttga 4440 
tcaiaagctcg cjcgcgttgtt tcatcaagcc ttac^ aaatcaatat 4500 

; . cactgtgtgg cttcaggccg -ccatcbactg cggagccgta caaatg^ 4560 

: • tcggttcgag atggcgctcg atgacgccaa ctacctctga tagttgagtc gatacttcgg 4 620 
cgatcaccge ttccctcatg atgttjiaact ttg^ tgctgcgtaa 4 68d 

catcgttgct gctccataac atcaakcatc gacccacggrc g^aacgcgct tgctgcttgg 4740 
at^cccgagg catagactgt accccaaaaa aacagtcata acaiagccatg aaaaccgcca 4800 

: : , ctgcgccgtt accaccgctg cgttcggtca aggttctgga ccagttgcgt gagcgcatac 4860 
gctacttgca t1:acagctta cgaacbg^ aggcttatg't ccactgggtt cgtgccttca 4920 
/ tcGgtttcca cggtg^tgcgt cacccggcaa ccttgggcag cagcgaagtp gaggcatttc 4980 
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tigtcctggct 
ccttgctgtt 
gaagacctcg 
tcctcggttt 
tgcggatcag 
tcatcgtgcg 
cacccagfcct 
tagattatat 
tttcacaaaa 
gttgctagtt 
tttcttccag 
cggcagcttt 
agctgtaaca 
ttcacctgtt 
gtgaacagct 
accgttttca 
tctactttt^ 
agatccttcc 
atgagaacga 
aactggtgag 
g1:aggtagga 
tgttctcaag 
tatcagtcgg 
ctttacttafc 
caagcattaa 
tcttttgtgt 
aagacttaac 
atatctcttc 
actggaaaat 
gctctctggt 
gagcgtattg 
tggggttgag 
agcgactaat 
ggtctaggtg 
tccttttcct 
taaattctgc 
ttcaagtggt 
ccagccctgt 
aaacgctgtt 
tcgcaagctc 
cgctgtcttt 
aaatggcact 
aagcccgtca 
ctttttgctg 
ccgtgacagg 
cttacccgtc 



ggcgaacgag 

qttctacggc 

gccgtcgcgg 

tctggaaggc 

tgagggtttg 

ggagggcaag 

gcgcgagcag 

tactaattaa 

cggtttacaa 

tgttatcaga 

aattgccatg 

gattcgataa 

agttgtctca 

ctattaggtg 

ttaaatgcac 

tctgtgcata 

tttgttagtc 

gtatttagcc 

accattgaga 

ctgaattttt 

atctgatgta 

ttcggttacg 

gcggcctcgc 

tggtttcaaa 

catgaactta 

tagttctttt 

atgttccaga 

actaaaaact 

ctcaaagcct 

tgctttagct 

gttataagtg: 

tagtgccaca; 

cgctagttca' 

attttaatca 

ttgagttgtg 

tagaccctct 

tataatttat 

gtataactca 

tgctcctcta 

gggcaaatcg 

ttcgtgacat 

acaggcgcct 

cgggcttctc 

ttcag;cagtt. 
tcattcagac 
ttactgtcaa 



cgcaaggttt 

aaggtgctgt 

cgcttgccgg 

gagcatcgtt 

caactgcggg 

ggctccaagg 

gggaattgat 

ttggggaccc 

gcataacggg 

atdgcagatc 

attttttccc 

gcagcatcgc 

ggtgttcaat 

ttacatgctg 

caaaaaclicg 

tcfgacagttt 

ttgatgcttc 

agtatgttct 

tcatgcttac 

gcagttaaag 

atggttgttg 

agatccattt 

ttatcaacca 

acccattggt 

aattcatcaa 

aataaccact 

ttatatttta 

aattctaatt 

ttaaccaaag : 

aatacaccat 

aacgataccg 

cagcataaaa 

tttgctttga 

ctataccaat 

ggtatctgta 

gtaaattccg 

agaataaaga 

ctactttagt • 

caa.aacagac 

ctgaatattc 

tcagttcgct 

tttatggatt 

agggcgtttt 

cctgccctct 

tggctaatgc 



cggtctccac 

gcacggatct 

t99tgctgac 

tgttcgccca 

tcaaggatct 

ajicgggcctt 

ccggtggatg 

tagaggtccc 

ttttgctgcc 

cggcttcagg 

cacgggaggc 

ctgtttcagg 

ttcatgttct 

ttcatctgtt 

taaaagcitct 

tccctttgat 

actgatagat 

ctagtgtggt 

tttgcatgtc 

catcgtgtag 

gtattttgtc 

gtctatctag 

ccaatttcat 

taagcctttt 

ggctaatctc 

cataaatGct 

tgaatttttt 

tttcgcttga 

gattcctgat . 

aagcattttc 

tccgttcttt 

ttagcttggt 

aaacaactaa 

tgagatgggc 

aattctgcta 

ctagaccttt 

aagaataaaa 

cagttccgca 

cttaaaaccc 

cttttgtctc 

gcgctcaqgg 

catgcaagga 

aitggcgggtc 

igattttccag 

acccagtatag 



gcatcgtcag 
gccctggctt 
cccggatgaa 
gcttctgtat 
ggatttcgat 
gatgttaccc 
accttttgaa 
cttttttatt 
cgcaaacggg 
tttgccggct 
gtcactggct 
ctgtctatgt 
agttgctttg 
acattgtcga 
gatgtatcta 
atctaacggt 
acaagagcca 
tcgttgtttt 
actcaaaaat 
tgtttttctt 
accattcatt 
ttcaacttgg 
attgctgtaa 
aaactcatgg 
tatatttgcc 
catagagtat 
taactggaaa 
gaacttggca 
ttceacagtt 
cctactgatg 
ccttgtaggg 
ttcatgctcc 
ttcagacata 
tagtcaatga 
gacctttgct 
gtgtgttttt 
aaagataaaa 
gtattacaaa 
taaiaggctta 
cgaccatcag 
ctctggcagt 
aactacccat 
tgctatgtgg 
tctgaccact 
gcagcggtat 



gcattggcgg 
caggagatcg 
gtggttcgca 
ggaacgggca 
cacggcacga 
gagagcttgg 
tgacctttaa 
ttaaaaattt 
ctgttctggt 
gaaagcgcta 
cccgtgttgt 
gtgactgttg 
ttttactggt 
tctgttcatg 
tcttttttac 
gaacagttgt 
taagaacctc 
tgcgtgagcc 
tttgcctcaa 
agtccgttac 
tttatctggt 
aaaatcaacg 
gtgtttaaat 
tagttatttt 
ttgtgagttt 
ttgttttcaa 
agataaggca 
tagtttgtcc 
ctcgtcatca 
ttcatcatct 
ttttcaatcg 
gttaagtcat 
catctcaatt 
taattactag 
ggaaaacttg 
tttgtttata 
agaatagatc 
aggatgtcgc 
agtagcaccc 
gcacctgaigt 
gaatgggggt 
aatacaagaa 
tgctatctga 
tcggattatc 
catcaacagg 



5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

55:20 

5580 

5640 

5700 

57 60 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600. 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7701 



<210> 26 

<211> 3888 . . 

<212> Dm 

<213> Artificial Sequence 

■<220> •'■ : 

<223> plasmid . ■ 



<400> 26 

tgcgccgcta cagggcgcgt; ccattcgcca 
tcggtgcggg CGtct»tcg;Gt attacgccag 
agc^gfcagat tggtggactt aacaaattat 



ttcaggctgc gcaactgt'tg gga^g99P?«^ ^0 . 
tttg^gggtg agttcatgaa gtttcgtcgc 120 
ttgttaaccc atccgcacga gttaataccg 180 
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ctaacctttt tctctgagcg gtatgaatct .gcaaaatekt cgatcagtjga agatttaaca 240 
attattaaac; aaacctttga acagcagggig attggtadtt tgcttactgt tcccggagct 300 
gcctfgaggcg ttaaatatat tccgaaa^tg aagcsiggctg aagctgaa<ga gtttgtgcag 360 
acacttggac agtcgctggc iaaatcctgag cgtatccttc cgggcggbta tgtatattta 420 
acggatatct taggaaagcc :atctgtactc tccaaggtag ggaagctgtt tgcttccgtg 480 
tttgcagjagc gcgaaattga tgttgtcatg accgttgcca pgaaaggcat ccctcttgcg 540 
tacgcagctg caagcibattt ;gaatgtgcct gttgtgatcg jttcgtaaaga caataaggta 600 
acagagggct ccacagtcag I cattaattac gtttcaggct t::ctcaaaqcg cattcaaaca 660 
atgtcacttg cgaaaagaag i catgaaaacg ggttcaaacg fcactcattat tgatgacttt 720 
atgaaagcag g.cggcaccat ' taatggtatg attaacctgt tggatgagtt taacgcaaat 780 
gtggcgggaa tcggcgtctit agttgaagcc gaaggagtag atgaacgtct tgttgacgaa- 840 
tatatgtcac ttcttactct ttcaaccatic aacatgaaag agaagtcciat tgaaattcag 900 
aatggcaati: ttctgcgttt ttttaaagac aatcttttaa agaatgga:ga gacagaatca 960 
tgacaaaagc agtccacaca aaacatgccc cagcggcaat cgggcctt^t tcacaaggga 1020 
ttatcgtcaa caatatgttt tacagctcag gccaaatccc tttgactdct tcaggcgaaa 1080 
tggrtgaatgg cgatattaag gagcagactc atcaagtatt cagcaattta aaggcggttc 1140 
tggaagaagc gggtgcttct tttgaaacag ttgtaaaagc kactgtattt atcgcggata 1200 
tggaacagtt tgcggaagta aacgaagtgt acggacaata ttttgacact cacaaaccgg 1260 
cgagatcttg tgttgaagtc gcgagactcc cgaaggatgc gttagtcc/ag atcgaagtta 1320 
ttgcactggt gaaataataa gaaaagtgat tctgggagag CGgggatcjac ttttttattt 1380 
accttatgcc cgaaatgaaa gctttatgae cctgcattaa tgaatcgg^cc aacgcgcggg 1440 
gagaggcggt ttgcgtattg, ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc 1500 
ggtcgt1;cgg ctgcggcgag cggtatcagc tcaictcaaag gcggtaatac ggttatccac 1560 
agaatcaggg gataacgcag gaaagaacat gtgagcaaaa ggccagcstaa aggccaggaa 1620 
ccgtaaaaag gccgcgttgc tggcgttttt cgataggctc bgcccccctg acgagcatca 1680 
eaaaaatcga cgctcaagtc agaggtggcg aaacccgaca iggactataaa gataccaggc 1740 
gtttccccct ggaagctccc tcgtgcgctc tcctgttGcg^ccctgccgc ttaccggata. 1800 
cctgtccgcc tttctccctt cgggaagcgt ggcgctttct patagcfccac gctgtaggta 1860 
tctcagttcg -gtgtaggtcg ttcgctccaa gctgggctgt igtgcacgaac cccccgttca 1920 
gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag jtccaacccgg taagacacga 1980 
cttatcgcca ctggcagcag ccactggtaa caggattagc agagcgaggt atgtaggcgg 2040 
tgctacagag ttcttgaagt ggtggcctaa ctacggctac kctagaagga cagtatttgg 2100 
tatctgcgct ctgctgaagc : cagttacctt cggaaaaaga gttggtagct cttgatccgg 2160 
caaacaaacc accgctggta gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag 2220 
aaaaaaagga tctcaagaag atcctttgat cttttctacg gggtctgacg ctcagtggaa 2280 
cgaaaactca cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat 2340 
ccttttaaat taaaaatgaa gttttaaatc aatctaaa^t atatatgaigt aaacttggtc 2400 
tgacagttac caatgcttaa tcagtgaggc acctatctca gcgatctgtc tatttcgttc 2460 
atccatagtt gcctgactcc ccgtcgtgta gataactacg atacgggagg gcttaccatc 2520 
tggcccbagt gctgicaatga taccgcgaga cccacgctca iccggctcqag atttatcagc 2580 
aataaaccag ccagccggaa gggccgagcg cagaagtggt bctgcaactt tatccgcctc 2640 
catccagtct attaattgtt gccgggaagc tagagtaagt ..agttcgccag ttaatagttt 2700 
gcgcaacgtt gttggcattg ctacaggcat cgtggtgtca cgctcgtcgt ttggtatggc 27;60 

. ttcattcagc tccgrgttccc aacgiatcaag ; gcgagttaca ^gatccccca tgttgtgcaa 2820 • 
aaaagcggtt agctccttcg gtcctccgat cgttgtcaga iagtaagttgg ccgcagtgtt 2880 
atcactcatg gttatggcag cactgcataa ttctcttact igtcatgcdat ccgtaagatg 2940 
cttttctgtg actggtgagt actcaaccaa gtcattctga igaataccgcg cccggcgacc 3000 
gagttgctct tgcccggcgt caatacggga taatagtgta tgacatagca gaactttaaa 3060 
agtgctcatc attggaaaac gttcttcggg gcgaaaactc tcaaggatct taccgctgtt 31^20 
gagatcpagt tcgatgtaac ccactcgtgc acccaactga tcttcagcat cttttacttt 3180 

.caccagcgtt tctggjgtgag Gaaaaacagg aaggcaaaat gccgcaaa^a agggaataag 3240 . 
ggcgacacgg aaatgttgaa tactcatact cttcctttitt caatattatt gaagcattta 3300 
tcagggittat tgtctcatga gcggatabat atttgaatgt jatttagadaa ataaacaaat 3360 
aggggttccg cgcacatttc cccgaaaagt gccacctgta itgcggtgtga aataccgcac 34;20 
agatgcgtaa ggagaaaata; ccgcatcagg cgaaattgta aacgttaata ttttgttaaa 34;80 
attcgcgtta aatatttgtt aaatcagctc attttttaac caataggpcg aaatcggcaa 35:40 
aatpccttat aaatc^aaagi.aatagacpga gatag^ cagtttggaa 3600 

caagagtcca ctattaaaga' acgtggactc caacgtcaaa gggcgaaaaa ccgt ct at ca; 36:60. 
gggcgatggc ccactacgtg. aaccatcacc c.a^atcaagt -tttttgcgiigt cgaggtgccg 3720 



Wb 2004/005527 



P|CT/US2002/021336: 



taaagctcta aatcggaacc ctaaag:ggaig cccccgattt agagcttgac ggggaaagcc 3780 v 
. iggcgaacgtg gcgagaaagg; aagggaa^aia ag.cgaaagga gcgggcgcta gggcgctggc 3840 
aagtgtagcg gtcacgctgc gcgtaaccac cac.acccgcc gcgcttaa : 3888! i . 

<210> 27 . ! !. 

<211> 4606 ' I ' 

<212> DNA . . . . • ' - . 

<213> Artificial Sequence . 

<220>: - : 

<223> Description of Artificial Sequence: pi a smid 

<400> 27 , . - . 

tgcgccgcta cagggcgcgt ccattcgcca ttcaggctgc gcaactgttg ggaagggcga 60 
tcggtgcggg cctcttcgct attacgccag ctggcgaaag ggggatgtgc tgcaaggcga J2.0 
ttaagttggg taacgccagg gttttcccag tcacgacgtt gtaaaacgac ggccagtgaa 180 
ttgtaatacg actcactata gggcgaattg ggcccgacgt cgcatgctcc cggccgccat 240 
ggccgcggga tgcggccgcg tcgacgtgaa ataccgcaca gatgcgtaag gagaaaatac 300 
cgcatcaggc gataaaccca gcgaaccatt tgaggtgata ggtaagatta taccgaggta 360 
tgaaaacgag aattggacct ttacagaatt actctatgaa gcgccatatt taaaaageta 420 . 
ccaagacgaa gaggatgaag aggatgagga ggcagattgc cttgaatata ttgacaatac 480 '.. 
tgataagata atatatcttt tatatagaag atatcgccgt atgtaaggat ttpagggggc 540- • 
aaggcatagg cagcgcgctt atcaatatat ctatagaatg ggcaaagcat aaaaacttgc 600 
atggactaat gcttgaaacc caggacaata accttatagc ttgtaaattc tatcataatt 660 
gtggtttcaa aatcggctcc gtcgatacta tgttatacgc caactttcaa aacaactttg 720 
aaaaagctgt tttctggtat ttaaggtttt agaatgcaag gaacagtgaa ttggagttcg 780 ^ 
tcttgttata attagcttct tggggtatct ttaaatactg tagaaaagag gaaggaaata 840 
ataaatggct aaaatgagaa tatcaccgga attgaaaaaa ctgatcgaaa aataccgctg 900 
cgtaaaagat acggaaggaa tgtctcctgc taaggtatat . aagctggtgg gagaaaatga 960 
.aaacctatat' ttaaaaatga cggacagceg gtataaaggg accacctatg atgtggaacg 1020, 
ggaaaaggac atgatgctat ggctggaagg aaagctgcct gttccaaagg tcctgcactt 1080 
tgaacggcat gatggctgga gcaatctgct catgagtgag gccgatggcg tcctttgctc 114 0 
ggaagagtat gaagatgaac aaagccctga aaagattatc gagctgtatg cggagtgcat 1200 
.caggctcttt cactccatcg acatatcgga ttgtccctat acgaatagct tagacagccg 1260 ■ 
cttagccgaa ttggattact tactgaata^ cgatctggcc gatgtggatt gcgaaaactg 1320 
ggaagaagac actccattta aagatccgcg cgagctgtat gattttttaa agacggaaaa 1380 ... 
gcccgaagag gaacttgtct tttcccacgjg cgacctggga gacagcaaca tctttgtgaa 14 40, 
lagatggcaaa gtaagtggct ttattgatct tgggagaagc ggcagggcgg acaagtggta 1500 ' 
■tgabattgcc ttctgcgtcc ggtcgatcag ggaggatatc ggggaagaac agtatgtcga 1560 
gctatttttt gacttactgg ggatcaagcc tgattgggag aaaataaaat attatatttt 1620 
actggatgaa ttgttttagt acctag^attt agatgtctaa aaagctttaa ctacaagctt 1680 
tttagacatc taatcttttc tgaagtacat ccgcaactgt ccatactctg atgtttt;ata 1740 
:tcttttctaa aagttcgcta gataggggtc ccgagcgcct acgaggaatt tgtatcgcca 1800. 
ttcgccattc aggctgcgca actgttggga agggcgatcg gtgcgggtac . cgggatcact 1860 
agtgcggccg cctgcaggtc gaccatatgg gagagctccc aacgcgttgg atgcatagct 1920 
•tgagtattct atagtgtca'c ctaaat'agct tggcgtaatc atggtcatag ctgtttqctg 1980 
tgtgaaattg ttatccgctc acaattccac acaacatacg agccggaagc ataaagtgta 2040 
aagcctgggg tgcctaatga gtgagptaac tcacattaat tgcgttgcgc tcactgdccg 2100 
jctttccagtc gggaaacctg tcgtgccagc tgcattaatg aatcggccaa cgcgcgg'gga 2160 
gaggcggttt gcgtattggg cgctcttccg cttcctcgct cactgactcg ctgcgcticgg 2220 
tcgttcggct gcggcgagcg gtatcagctc actcaaaggc ggtaataicgg ttatcca:cag 2280 
iaatcagggga taacgcagga aagaacatgt gagcaaaagg ccagcaaaag gccaggaacc 2340 
jgtaaaaaggc cgcgttgctg gcgtttttcg ataggctccg cccccctgac gagcatcaca 2400' 
aaaatcgacg ctcaagtcag aggtggcgaa acccgacagg actataaaga taccaggcgt 24 60' 
ttqpccctgg aagctccctc gtgcgctctc ctgttccgac cctgccgctt accggatacc 2.520. 
tgtccgcctt tctcccttcg ggaagcgtgg cgctttctca tagctcacgc tgtaggdatc 2580 
:*^cagttcggt gtaggtcgtt; cgctccaagp gcacgaaccc cccgttcagc 2640." 

iccgaccgctg cgccttatcc! ggta&ctatc gtcttgagtc caacccggta agacacg:aGt 2700 • 
tategceapt ggeagcagcc: actggfcaac^ ggattageag agcgaggtat gtaggcg^tg 2760. ^ 
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. ctacagagtt cttgaagtgg tggcl^t^iacit acggctacac tagaaggapa gtatttggta 2820 
tdtgcgctct g.ctgaagcda gttaccttcg gaaaaagagt tggtagctct tgatccggca 2880 
aacaaaccac cgctg^tagc ggtggttttfe ttgtttgcaa gcagcagatt acgcgcagaa 2940 
aaaaaggatc tcaajgaagat cctttgatct tttctacggg gtctgacgct cagtggaacg 3000 
aaaactcacg ttaatrggatt ttggtcatgia gattatcaaa aaggatcttc acctagatcc 3060 
ttttaaatta aaaatga^gt tttaaatcaa tctaaagtat atatgagtaa acttggtctg 3120 
acagttacca atgcttaatc agtgaggcap ctajtctcagc gatqtgtcta tttcgttcat 3180 
ccatagttgc ctgactcccc gtcgjtgtagia taactacgat acgggagggc ttaccatctg 3240 
gccccagtgc tgcaatgata ccgcgagacc cacgctcacc ggctccagat ttatcagcaa 3300 
taaaccagcc agccggaagg gccgagcgca gaagtggtcc tgcaacttta tccgcctcca 3360 
tccagtctat taattgttgc cgggaagcta gagtaagtag ttcgccagtt aatagtttgc 3420 
gcaacgttgt tggcatt^ct acaggcatcg tggtgtcacg ctcgtcgttt ggtatggctt 3480 
cattcagctc cggttcccaa cgatcaaggc gagttacatg atcccccatg ttgtgcaaaa 3540 
aagcggttag ctccttcggt cctccgatc'g ttgtcagaag taagttggcc gcagtgttat 3600 
cactcatggt tatggcagca ctgcata^tt ctcttactgt catgccatcc gtaagatgct 3660 
tttctgtgac tggtgagtac tcaaccaagt cattctgaga ataccgcgcc cggcgaccga 3720 
gttgctcttg cccggcgtca atacgggata atagtgtatg acatagcaga actttaaaag 3780 
tgctcatcat tggaaaacgt tcttcggggc gaaaactctc aaggatctta ccgctgttga 3840 
gatccagttc gatgtaaccc actcgtgcab ccaactgatc ttcagcatct tttactttca 3900 . 
ccagcgtttc tgggtgagca aaaacaggak ggcaaaatgc cgcaaaaaag ggaataaggg 3960 
cgacacggaa atgttgaata ctcatactct tcctttttca atattattga agcatttatc 4020 
agggttattg tctc^tgagc ggatacatat ttgaatgtat ttagaaaaat aaacaaatag 4080 
gggttccgcg cacatttccc cgaaaagtgc cacctgtatg cggtgtgaaa taccgcacag 4140 
atgcgtaagg agaaaatacc gcatcaggcg aaattgtaaa cgttaatatt ttgttaaaat 42Q0 
tcgcgttaaa tatttgttaa atcagctcat tttttaacca ataggccgaa atcggcaaaa 4260 
tcccttataa atcaaaagaa tagaccgaga tagggttgag tgttgttcca gtttggaaca 4320 
agagtccact attaaagaac gtggactcca acgtcaaagg gcgaaaaacc gtctatcagg 4380 ^ ^ 
gcgatggccc actacgtgaa ccatbaccqa aateaagttt tttigcggtcg aggtgccgta 4440 
.aagctctaaa tcggaaccct aaagggagcc cccgatttag agcttgacgg ggaaagccgg 4500 
cgaacgtggc gagaaaggaa. gggaagaaag cgaaaggagc gggcgctagg gcgctggcaa 4560 
-gtgtagcggt cacgctgcgc gtaaccaGck caeccgccgc gcttaa ; ^ 4606 

<210> 28 

<211> 5399 : . ! 
<212> DRA 

<213> Artificial j Sequence > 
•<220> ■ • 

<223> Description of Artificial; Sequence :plasmid 
<400> 28 : . i ' . ! 

tgcgccgcta cagggcgcgt ccattcgcQ.a ttcaggctgc gcaactgttg ggaagggcga 60 
tcggtgcggg cctcttcgct attacgccag tttgggggtg agttcatgaa gtttcgtcgc 120 
agcggcagat tggtggactt: aacaiaattat ttgttaaccc atccgcacga gttaataccg 180 
ctaacctttt tctctgagcg gtatjgaatct gcaaaatcat cgatcagtga agatttaaca 240 
attattaaac aaacctttga acagbagggg attggtactt tgcttactgt tcccggagct 300 
gccggaggcg ttaaatai:at tccgaaaatg aagcaggctg aagctgaaga gtttgtgcag 360 
acacttggac agtcgctggc aaatbct<ga|g cgtatccttc cgggcggtta tgtatattta 420 
acggatatct taggaaagcc atctgtactp tccaaggtag ggaagctgtt tgcttccgtg 480 
tttgcagagc gcga-aattga tgttgtcatig acegttgcca cgaaaggcat ccctcttgcg 540 . 
tacgcagctg cggccgcgtc gacaaaccG|a gtgaaccatt tgaggtgata ggtaagatta 600 
taccgaggta tgaaaacgag aatt^gacet ttacagaatt actctatgaa gcgccatatt 660 
taaaaagcta ccaagacgaa ■ gaggatgaag aggatgagga ggcagattgc cttgaatata 720 
ttgacaatac tgataagata ktatiatctt-t tatatagaag atatcgcogt atgtaaggat 780 
ttcagggggc aaggcatagg cagbgcgct|t atcaatatat ctatagaatg ggcaaagcat 840 
aaaaacttgc atggactaat gcttgaaacc caggacaata accttatagc ttgtaaattc 900 
tatcataatt gtggtttcaa aajtcjggctGC gtegatacta tgttatacgc caactttcaa 960 
aacaactttg aaaaagctgt. ttfec-tggitajt ttaaggtttt agaatigcaag gaacagtgaa 1020 
ttggagttcg tcttgttata attagcttcfc tggggtatct ttaaatactg tagaaaagag 1080 
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gaaggasiata- 
aataccgciig; 



atgtgtfaacg 
t Gctg.cact t 
tcctttgctc 
cggagtgcat 
tagacagccg 
gcgaaaact'g 
. agacggaaka 
tctttgtgka 
acaagtggta 
a^tatgtrcga 
attatatttt 
Ctapaagctt. 
atgttttata 
tgtatc^eca 
acaataaggt 
gcattcaaac 
ttgatgactt 

. ttaacgcaaa 
.ttgtt'gja.cga 
ttgaaattca 
aigac^gaatc 
ttcacaaggg 
;ttcagigcgaa 
.Waaggcggtt 
tatcgcggat 
tcacaaaccg 
gatcgaagtt 
pttttttatt 
caacgcgcgg 
tcgctgegGt 
:cggttatcca 
aaggccagga 
g.acgagcatc 
agataccagg 
cttaccggat 

• cgctgtag.gt 
ccccccgttc 

. gtaagacacg 
tatgtaggQg 
acagtatttg 
tcttgatccg 
attacgcgca 
gctcagtgga 
ticacctaga 
taaacttggt 
ctatttcgtt 
ggcttaccat 
gatttatcag 
ttatccgcct 
gttaatagtt' 
tttggtatgg 
aitgttgtgca 

: gccgcagtgt 
tccgtaagat 
gcccggcgac 
agaacttti^a 



ataaatg^ct 
cgtaajkagat 
aaacctatat 
ggaaaaggac 
tgaacggcat 
ggaagagtat 
caggctcttt 
cttagccgaa 
ggaagaagac 
gcccgaagag 
agatggcaaa 
tgacattgcc 
gctatttttt 
=actggatgaa 
ittagacatc 
tcttttctaa 
ggtaccagct 
aacagagggc 
aatgtpabtt 
tatgaaagca 
tgtgigGggga 
atataligtca 
gaatggcaat. 
atgacaaaag 
attatcgtca 
atcfgtgaatg 
Gtggaagaag 
atggaacagt 
gcgagatctt 
attgcactgg 
tabcttatgb 
ggagaggcgg 
cg^tcgttcg 
cagaatcagg 
accgtaaaaa 
acaaaaatcg 
Ggtttccccc 
acctgtccgc 
atctcagttc 
agcccigaccg 
acttafccgcc 
gtgctaciaga 
gtatCitgcgc 
gcaaaca^.ac 
gaikaaaaagg . 
acgaajaactc 
tcbttttaaa 
ctgacagtta 
catccatagt 
ctggccccag 
caataaacca 
ccatccaigtc 
tgcgcaapgt: 
cttcattcag 
aaaaagcggt 
tatcactcat 
gqttt&ctgt 
pgagt^gctc 
aaigtgctbat 



aaaatgagaa 
acggaaggaa 

: ttaa:aaatga 
atgai^gctat 
gatggctgga 
gaagatgaac 
cactccatcg 
ttggattact 
acitccattta 
gaacttgrtct 
gtaagtggct: 
ttctgcg-tcc 
gacttactgg. 
ttgttttagt. 
taatcttttc 
aagttcgctai 
gcaagctatt 
tccacagtca 
gcgaaaagaa 
ggcggcacca 
atGggcgtc|t 
cttcttactb. 
tttctgcgtt 
cagtccacac 
acaatatgtt 
gGgatattak 
cgggtgcttpr 
ttgcggaagjb- 
gtgttgaagt. 
tgaaataata' 
GGgaaatga^; 
tttgcgtatt 
gGtgGggcga : 
ggataaGgGa . 
ggcGgGgttg 
acgctca^gt 
tggaagctce 

CtttGtcCGt 

ggtgtaggtc 
ctgGgGGtta 
actggGagGa 
gttGttgaag 
tctgctgaag; 
caccgctggt 
atGtGaagaa" 
acgttaaggg 
ttaaaaatga . 
GGaatgctta; 
tgbctgaGtG: 
tgctgcaatg; 
gccagccgga;. 
tattaattgt 
tgttggcatt: 
GtGcggttGb 
tagctGcttc 
ggttatggca 
gactggtgag 
ttJgcccggGg 
cattggaaaa 



taticacbgga 
. tgtctcctg.c 
: cggacagccjg 
^gctggaagg 
: gcaatGtgGt 
• aaagccctga 
acatatbggia 
tactgaataa 
aagatccgcg 
tttcbcaGg'g 
ttattgatGt 
ggtcgatcag 
ggatcaagbc 
acctagattt 
■ tgaagtacat 
gataggggtc 
tgaatgtgcc 
gcattaatta 
gcatgaaaaG 
ttaatggtat 
tagttgaagc 
tttcaaccat 
tttttaaaga 
aaaacatgGC 
ttacagGtca 
g^agGagact 
ttttgaaaca 
aaacgaagtg 
cgcgagactc 
agaaaagtga 
agctttatga 
gcrgcgctctt 
gcggtatcag 
ggaaagaaGa 
ctggcgtttt ; 
cagaggtggc 
btcgtgcgct 
tcgggaagcg 
gttcgctcca 
tGcggtaact 
gcGaGtggta 
t^gtggGGta 
ccagttacct 
agcggtggtt 
gatcctttga 
attttggtca 
agttttaaat 
atcagtgagg 
GCGgtcgtgt 
ataccgcga;g 
agggccgagc 
tgGcgggaag 
gctacaggca 
caacgatGaa 
ggtcctGcga 
gbactgcata 
tactcaacba 
tcaatacggg 
cgttcttcgg 



attgaaaaaa 
taaggtatat 
gtataaaggg 
aaagbtgcct 
catgagtgag 
aaagattatc 
ttgtcGctat 
cgatGtggcc 
cgagctgtat 
cgacGtggga 
tgggagaagG 
ggaggatatc 
tgattgggag 
agatgtbtaa 
ccgcaactgt 
ccgagcgcct 
itgttgtga-fcG 
GgtttGaggG 
gggttcaaac 
gattaacctg 
pgaaggagta 
■ caaca'bgaa^ 
caatctttta 
ccagcggcaa 
ggcGaalatcc 
catcaagtat 
gttgtaaaag 
tacgcfacaat 
ccgaaggatg 
:ttctgggaga 
.CGctgcatta 
ccgGttcctG 
btcactGaaa 
tgtgagcaaa 
tcgataggct 
: gaaacccgac 
ctcctgttcc 
tggGgctttc 
agctgggctg 
atcgtcttga 
aicaggattag 
actacggGta 
tcggaaiaaag 
tttttgtti:g 
tctttticitac 
tgagatta1:c 
caatGtaaag 
cacctatctc 
agataactac 
acccacgctc 
iigcagaagtgg 
ctagagtaag 
tcgtggtgitG 
jggGgagttac 
tcgttgtbag 
attbtcttab 
lagtpattdtg 
ataatagtgt 
ggcgaaaact 



ictgatcgaaa; 
aagqtggtgg 

acGacctat^ 
gttccaaagg 
gGGgatggcg: 
•gagctgtatgj 
acgaatagct 
gatgtggatt. 
gattt'tttaa- 
.gaGagcaaca' 
ggcagggGgg 
ggggaagaac 
aaaataaaat^ 
■aaagctttaa 
••cGatactbtg 
iacgaggaa-tt 
gttGgtaaag 
tGGtGaaaGG 
IrgtactGatta 
vttggatgagt 
gatgaacgtc 
"gagaacftcca 
aagaatggag 
tGgggGctta 
GtttgactGc 
tcagGaattt 
Gaactgtatt 
vattttgacab 
•cgttagtcga 
^gccgggatca 
atgaatGggc 
gctcaGtgaG 
ggGggtaata 
aggccagGaa 
/ccgcccccct 
aggactataa 
gaccGtgccg 
tcatagctGa 
tgtgcacgaa 

'gtGGaacGGgr 
: GagagGgagg 
, caGtagaagg 
;agttggtagc 
.'caagcagcag 
• ggggtctgac 
aaaaag^atG 
ftatatatgag 
agGgatctgt 
gataGgggag 
jaccggctcca 
Itcctgcaact 
itagttcgGGa 
lacgctcgtcg 
jatgatcGGCc 
aagi:aagttg 
.tgtcatgcca 
■agaataccgc 
;atg^catagc 
ctcaaggatic 



1140 
1200 
1260 
1320 
1380 
1449 
1500 
= 1560 
1620 
1680 
1740 
1800 
1860 
i920 
1980 
2040 
2100 
2160 
2220 
2280 
2340. 
2400 
24 60 
2520 
2580 
2640 
27.00 
27 60 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3486 
35 40 
3600 
3660 
3720 

37 8 0; 

38 40 
3900 
3960 
40^0 
4080 
4140 
4200 
4260 
43120 
4380 
4440 
4500 
.45 (Sd 
4 62b 
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ttcgatgtaa .cccactcgtg cacccaactg atcttcagca 4680 

ttctgggtga gcaaaaacag gaaggca^aa . tgccgcaaaa 4740 

gaaa:tgttga atactcatac tcttcctttt tcaatattat 4800 

ttgtctcatg . agcggataca tatttgaatg itatttagaaa 4860 

gcgcacattt : cccqgaaaag tgccacctgt atgcggtgtg 4920 

aggagaaaat accgcatcag gcgaaattgt Jaaacgttaat 4980 

aaatatttgt taaatcagct cattttttaa ccaataggcc .5040 

taaa;tcaaaa igaatagaccg agatagggtt gagtgttgtt 5100 

actattaaag aacgtggact ccaacgtcaa agggcgaaaa 5160 

cccactacgt gaaccatca:c ccaaatcaag Jttttttgcgg 5220 

aaatcggaac i cctaaaggga gcccccgatt tagagcttga 5280 

ggcgagaaag gaagggaaga aagcgaa^gg agcgggcgct 5340 

ggtcacgctg cgcgtaacca ccacacccgc -cgcgcttaa 5399 

<210> 29 
<211> 6805 : 
<212> DNA 

<213> Artificial Sequence ; ; 
<220> : •' ■ 

<223> Description of Artificial SequLencerplasmid 
<400> 29 

ttgcggccgc ttcgaaagct gtaatataaa aaccttcttc aactaacggg gcaggttagt 60 
gacattagaa aaccgactgt aaaaagtaca gtcggcatta tctcatatta taaaagccag 120 
tcattaggcc tatctgacaa ttcctgaata ; gagttcataa acaatcctgc atgataacca 180 
tcacaaacag aatgatgtac ctgtaaagat, agcggtaaat ktattgaatt acctttatta 240 
atgaattttc ctgctgtaat aatgggtaga aggtaattaci tattattatt -gatatttaag 300 
ttaaacccag taaatgaagt ccatggaat^ , atagaaagag iaaaaagcatt ttcaggtata 360 
ggtgttttgg gaaacaattt ccccgaacca ; ttatatttct : ctacatcaga aaggtataaa 420 
tcataaaact ctttgaagtcattctttaca ggagtccaaa feaccagagaa tgttttagat 480 
acaccatcaa aaattgtata aagtggctct : aacttatccc iaataacctaa ctctccgtcg 540 
ctattgtaac cagttctaaa ag'ctgtattt ,gagtttatca cccttgtcac itaagaaaata 600 
aatgcagggt aaaatttata tccttcttgt tttatgtttc ggtataaaac actaatatca 660 
atttctgtgg ttatactaaa agtcgtttgt tggttcaaat aatgattaaa ±atctctttt 720 
ctcttccaat tgtctaaatc aattttatta aagttcattt gatatgcctc ctaaattttt 780 
atctaaagtg aatttaggag gcttacttgt ctgctttctt cattagaatc aatccttttt 840 
taaaagtcaa tattactgta acataaatat : atattttaaa aatatcccac tttatccaat 900 
tttcgtttgt tgaactaatg ggtgctttag ittgaagaata iaagaccacat taaaaaatgt 960 
ggtcttttgt gtttttttaa aggatttgag ' cgtagcgaaa aatecttttc tttcttatct 1020 
tgataataag ggtaactatt gaattcggta = ccaagagttt gtagaaacgc aaaaaggcca 1080 
tccgtcagga tggccttct^ cttaatttga tgcctggcag tttatggcgg gcgtcctgcc 1140 
cgccaccctc cgggccgttg cttcgcaacg : ttcaaatecg ctcccggcgg atttgtccta 1200 
ctcaggagag bgttcaccga caaacaacag. ataaaacgaa jaggcccagtc tttcgactga 1260 
gcctttcgtt ttatttgatg cctggcagtt ccctactctc gcatggggag accccacact 1320 
accatcggcg ctacggcgtt • tcacttctga gttcggcatg gggtcaggtg ggaccaccgc 1380 
gctactgccg ccaggcaaat tctgttttat : cagacdgctt ctgcgttctg atttaatctg 1440 
tatcaggctg aaaatcttct ctcatccgcc aaaacaggat bcaattatgg -cagatcaatg 1500 
agcttcacag acacaatatc agggacattt igttagttctt tcacaatttt atcttccaga 1560 
tgtctgtcaa aggaaaigcat , cajtgatggct : tctccgcctt tttccttacg ^gccaacctgc 1620 
atagttgcaa tgttaiafcatc- attatctqcg :agaatacgtc btactcggcc gatgacacct 1680 
gttgtatctt gatgctggat atacaccaag tgbcca aatattaaat 1740 

cqattgatct cgaca-attcg ttctccgaak t^aiggaatat acgtagccgt tacagtaaag 1800 
gtgctgcggt ctcctgtcac ttttacgctg atgca^ttiat cgtatccaga ttcagaagag 1860 
gaaatttttt cactgiaagct aatgccgpgt tbttttgcga baccqccggc :attgacGtca 1920 
ttaacagtag agtctacgcg: cggttttkaajaagcctgaca igaagggcttt tgtaatgaac 1980 
gaitgtttcaa gttta.gcaat tgtgccttca Ltattg^ caacatcctg .tactggttct 2040 

ttcatgcact gt^atacaag: gctgc^ 2100 
:ttagcaaatt ^latctitfetgt ; csttggcag^fc ag^^ ctg[acatgac aggcatrgcct 2l50 



ttaccgctgt tgagafcccag 
tcttttactt tcaccagcgt 
aasrggaataa gggcgacacg ; 
tgaagcattt atcagg^tta 
aataaacaaa taggggttcc 
aaataccgca cagatgcgta 
attttgttaa aattcgbgtt 
gaaatcggca aaatccctta 
ccagtttgga acaagagtcc 
accgtctatc agggcgatgg 
tcgaggtgcc gtaaagctct 
cggggaaagc cggcgaacgt 
agggcgctgg caagtgtagc 



: wo:2004/o6^27 V : : • 5i^cnyusioo^2i336 

■ tttgc^aiact gcagaacttc ttctgacact tgggcgtfcga cattgagctig tgcttctttc 2220 

• gttgatgctc ccaagtgagg agtggcaatg actaatigrgat gatcaacaag tttgttgtca 2280 
. actggcggtt cgaqttcgaa aacgtcaslgG gctgctcccg caacatgccc gttttccaaa 2340 

gcttcgagaa gtgptgcttc atcgataatt ccgcctcgcg cacagttaat taagcgaacg 2400 
cctttttitcg tttt^tgcaat cgtttcttita ttcaataagc c^tttgtttc ittttgttaaa 24 60 
ggcgtgtgaa cggt^atgat atccgcadtt ; tcaagcactt cttcaaatgt acggctgttt 2520 
acgccgattt ttttcgctct ttcttccgtt aagaaaggat caaaaacgtg cacagtcata 2580 
ccgaacgctc ctcgacgctg tgcaatttca cttccgattc ggcctaatcc .tacaatacQa 2640 
; agcgtttttc cat^aagctc tgaapcgaca ta'agctgtgc ggttccactc tctggatttc 2700 
: actgagatat. tagcctgcgg aatgtgtctc attaaagaag agatcattg.c. aaatgtatgc 2^60 
: tcagctCftcg aaatggtgtt gccgttcgga gcattgatca cgattacc6c gtgtttcgta 2820 
: gcctcatcaa; tatcgatatt atcgacaccg acaccggctc ttccgacaat ttttaaagaa 2880 
gtcattttgt tgaaaaggtc ttctgttact tttigtcgcgc ttcgcaccaa aagagcatca 294 0 
aaagtatgta : attcatcttc tgcatct^ct acgttttttt ^aacgatttc aataaagtct. 3000 
gattcaajtaa gtggctgtaa accgtcgttg ctcattttgt ctgagacc^a tactcgaaac 3060 ; 
•atgttttctc ctcctctaga gcgtcctgct gttgttaaga ttattatacc acaccttgta 3X20 
; gataaagtca acaacttttt gcaaaatttt tcaggaattt td:gcagaggt tgttctggat 3J.80 
: gtagaa'caaa acatctttcc gctcttgtgc tgttaggata tctttcttgg aagctaggta 3240 
. • ggcctcgagt tatggcagtt ggttaaaagg aaacaaaaag accgttttca cacaaaacgg .3300 
i tctttttcga: tttcttttta cagtcacagc cacttttgca aaaaccggac - agcttcatgc 3360 
^ cttataactg ctgtrttcggt cgacctgcag gcatgcaagc ttcgcgaagc ggccgccgac 3420 
; gcgaggctgg atggccttcc ccattatgat £cttctqgct tccggcggca tcgggatgcc 3^80 
: egcgttgc^g: gccatgctgt ccaggcaggt agatgadgac catcagggac agcttcaagg 3540 
atcgctcgcg gctcttacca gcctaacttc gatcactgga ccgctgatcg tcacggcgat 3600 
ttatgocgcc tcggcgagca catggaacgg gttggcatgg atitgtaggcg ccgccctata i3660 

• ccttgtctgci ctccccgcgt tgegtcgcgg tgcatggagc cgggccacct cgacctgaat 3720 
ggaagccggc ggcacctcgc taacggattc accactccaa gaattggagc caatcaattc 3780 - • 
ttgcggagaa ctgtgaatgc gcaaaccaac ccttggcaga aicatatccat cgcgtccgcc 3840 - 
afcctccagca gccgcacgeg gcgcatc^cg ggcagcgrttg ggtcQtggec acgggtgcgc 3900 
atgatcgtgc tcctgtcgtt gaggacccgg ctaggctggc gigggttgcct t act ggt tag 3960 

. cagaatgaat caccgatacg cgagcgaacg tgaagcgact gctgctgcaa 'aacgtctgcg 4020 
; ^cctgpgcaa caacatgaat ggtcttcggt ttccgtgttt cgtaaagtct ggaaacgcgg 4080 
: aagtcagcgc cctgcaccat . tatgttccgg atctgcatcg caggatgctg ctggctaccc 4140 
! tgtggaacae ctacatctgt attaacgaag cgctggcatt gaccctgagt gatttttctc 4200 

• tggtcccgcc gcatccatac cgccagttgt ttaccctcac aacgttccag taaccgggca 4260 
tgttca:tcat cagtaacccg tatcgtgagc atcctctctc g.t^^ 4320 

: dccatga^ca gaaattcccc cttacacgga ggcatcaagt gaccaaacag gaaaaaaccg 4380 
J. cccttaac^t ggcdcgcttt atcagaagcc agacattaac gbttctggag aaactcaacg 4440" 
: agctggaipgc ggatgaacag gcagacatct gtgaatdgct tcacgaccac get gatgagc 4500 
.tttaccgcag ctgcctcgcg cgtttcggtg atgacggtga aaacctctga. cacatgcagc 4560 
: tcccggagac ggtcacagct tgtctgtaag cggatgccgg gagcagacaa gcccgtcagg 4 620 
: gcgcgtcagq gggtgttggc gggtgtcggg gcgcagccat gacccagtca cgtagcgata 4680 
. gcgga^tgta tactggctta actatgcggc atcagagcag attgtactgagagtgcaqca 4740 
: tatgcggtgt; gaa^taccgc ace^gatgcgt aaggagaaaa teiccgcatGa ggcgcticttc 4800 : 
; cgcttcctcg ctcactgact cgctgcgctc . ggtcgH egg ctgcggcgag cggtatcagc 4860 
•i tcactbaaag gcggtaatac ggttatccac ii^raatc^ggg gataacgcag gaaagaacat 4 920 
: gtgaigbaaaa ggccagcaaa aggccaggaa <bcgtaakaag gccgcgttgc tggcgttttt 4980 
; CQatagg:ctc cgcGcccctg acgagcatca caaaaatcga cgctcaagtc agaggtggcg 5040 
i aakcccgaca ggactataaa gatkceaggc gtttccccct ggaagctccc ;tcgt^gc|gctc 51^ 
: tcctgttpcg accc.tgccgc ttaccggata cctgtccgcc tttctccctt cgggaajgcgt 5160 
. . 1 ggcgctttct catagctcac gctgtaggi:a tetcagttcg gtgtaggtdg ttcgctcb 5220 
i gctgggptgt gtgc^icgaac cccccgttca gcccgaqcgc tgdgccttat ccggta'acta 5280 
: tcgtcttgag tccaacccgg taagacacga cttatpgcca ctggcagcag ccact ggt aa 5340! 

caggattagc agac(cgaggt atgtaggcgg tgctacagag ttcttgaagt ggtggcctaa 5400 

ctaeggctac actagaagga cagtatttgg tatctgcgct ctgctgaagc cagttabctt 5460 
. : cggaaaaaga gttggtagct cttgatccg^ caaaciaaacc apcgctggta gcggtggttt 5520 

ttttgtttgc aagcagcaga ttacgcgca^ aaaaaaagga tbtcaagaag atcctttgat 5^80 
: cttttctacg gggtctgacg ctcagtggaei cgaaaaptca cgttaa'ggga . ttttggtcat 5640 

gagatjiatca aaaaggatct tcacGtagati cctttjtaaat taaaaatgaa gttttaaatc 5700 
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. aatcta^agt 
acctalictca 
.gataactacg 
CGcacgctica 
c4gaagtggt 
tagagtaagt 
clftggtgtca 
gcgagttaca 
cgttgtcaga 
ttctcttact 
gtcattctga 
t^iataccgcg 
gc:gaaaactc. 
acccaactga 
aaggcaaaat 
cttccttttt 
atttgaatgt 
gccacc1:gac 
cacgagj^ccc 



atatatgagt 
!gcgatctgtd 
atacgggagg 
ccggctccag= 
cctgcaactt 
agttcgcckg 
cgctcgtcgt 
tgatccccca 
agtaagttgg 
gtcatgccat 
gaatagtgta. 
ccacatagca 
tcaaggatct' 
tcttcagcat 
gccgcaaaaa; 
caatattatt" 
atttagaaaa: 
gtctaagaaa 
tttcgtcttc" 



aaacttggtc 

taitttegttc: 

gcttaecatc 

atttatcagc 

tatGcgcctc. 

ttaatagttt 

ttggtatggc 

bgttgtgcaa 

cicgcagtgtt 

ccgtaagatg 

tgcggcgacc 

gaactttaaa 

taccgctgtt 

cttttacttt 

agggaataag 

gaagcattta 

ataaaeaaat 

qcattattat 

aagaa [ 



tgacagttac 

atccatagtt 

tggccccagt 

aataaaccag 

catccagtct 

gcgcaacgt.t 

t t cat t cage 

aaaagcggtt 

ajtcactcatg 

cttttctgrtg 

gagttgctct 

agtgctcatc 

gagatccagt 

caccagcgtt 

ggcgacacgg 

tcagggttat . 

aggggttccg 

catgacatta 



caatgcttaa 

gcctgactcc 

gctgcaatga 

ccagccggaa 

attaattgtt 

gttgccattg 

tccggttccc 

agctccttcg 

gttatggcag 

actggtgagt 

tgcccggcgt 

attggaaaac 

tcgatgtaac , 

tctgggtgag ' 

aaatgttgaa 

tgtctcatga 

cgcacatttc 

acctataaaa 



tbagtgaggc 
cpgtcgtgta 
taccgcgaga 
gggccgagcg 
gccgggaagc 
ctgcaggcat 
akcgatcaag 
gtcctccgat 
cactgcataa 
actcaaccaa 
caatacggga 
gttcttcggg 
ccactcgtgc 
caaaaacagg 
t^ctcatact 
gbggatacat 
cccgaaaagt 
ataggcgtat 



5760 
5820 
5880 
5940^ 
6000 

6060: 

6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6865 



<210> 30 . - , 

<211> 5983 

<21^> DNA • . 

<213> Artificial Sequence 

<22Q> 

<223> Description of vArtificial Sequence :plasmid 



.<400> 30 . 
tgcgcc^cta. 
tcggtgcggg 
ttaagttggg 
ttgtaatacg 
ggccgcggga 
caattatggc 
cacaatttta 
ttccttacgg 
tactcggccg 
ataaaaatca 
cgtagccgtt 
.gtatccagat 
accGccggca 
aagggctttt 
aacatcctgt 
^ttgatggtaa 
tgacat^aca 
attgagctgt 
atcaacaagt 
aacatgcccg 
cagtaaaata 
tagctcgada 
gtcataccac 
atotttcaca 
gggcttttcc 
-ttcGcagttt 
-taagcgigctg 
Gctgab^cac 
' tt Gogagcaa 
ttcaaagtgc 



cagggcgcgt 
cctcttcgct- 
taacgccagg 
actcactata 
tatcactagt 
agatcaatga 
tcttccagat 
ccaacctgca 
atgacacbtg 
atattaaatc 
acagtaaagg 
tcagaagagg 
ttgacctcat 
gtaatgaacg 
actggttctt 
ggcttaattt 
ggcaggcctt 
gcttctttcg 
ttgttgtcaa 
ttttccaaag 
taatatttta 
tactgttctt^ 
ttigrtccgccc 
aagatgttgc 
gtctttaaaai 
tcgcaatcca. 
tctaagctat" 
tccgcataca 
aggacgccat 
aggacctttig 



ccattcgcca 

attacgccag 

gttttccbag 

gggcgaattg 

gcggcegcct 

gcttcacaga 

gtctgt<:aaa 

tagttgcaat 

ttgtatcttg 

cattgatbtc 

tgctgcggtc 

aaattttttc: 

ta'acagtaga 

atgtttcaag 

tcatgcactg 

tagcaaattc 

ttgcgaactg 

ttgatgctcc 

ctggcggttc 

ctttttagac 

ttttctccca 

ccccgatatc 

tgccgcttct 

tgtctcccag 

aatcatacag 

catcggccag: 

tcgtataggg 

gctcgataat 

cggccicact . 

gaacaggcag 



..ttcaggctgc 
ctggcgaaag 
tcacgacgtt 
ggcccgacgt 
gcaggtcgac 
cacaatatca 
ggaaagcatc 
gttaatatca 
atgctggata 
giacaatitcgt 
tcctgtcact 
actgaagcta 
gtctacgcgc 
tttagcaatt 
tgatiacaagg 
atcttttgtc 
cagaacttct 
caagtgagga 
gacttcgaaa 
atctaaatct 
atcaggcttg 
ctccctgatc 
cccaagatca 
gtcgbcgtgg 
ctcgcgcgga 
atcgttattc 
acaatccgat 
cttttcaggg 
catgkigcaga 
cttticcttcc 



gcaaptgttg 

ggggatgtgc 

gtaaaacgac 

cgcatgctcc 

catatgggag 

igggacatttg 

atgatggctt 

:ttatctccga 

•tacaccaagt 

tctccgaaat 

tttacgctga 

atgccgcgtt 

iggttttaaaa 

igtgccttcat 

ctgccaattt 

atggcaggca 

tctgacactt 

gtggcaatga 

acgtcaagcg 

aggtactaaa 

;atccccag;ta 

^gaccggacgc 

ataaagccac 

gaaaagacaa 

tctttaaatg 

agtaagtaat 

atgtcgatgg 

ctttgttcat i 

•ttgctecagc ' 

agccatagca 



ggaagggcga 
tgcaaggcga 
ggccagtgaa 
cggccgccat 
agcccggatc 
ttagttcttt 
ctccgccttt 
g^atacgtcc 
gaccagtcgg 
gaggaatata 
.tgcagttatc 
cttttgcgac 
agcctgacag 
attgaatggc 
'ttcctgcaat 
ggttgatagc 
gggcggcgac 
ctaatggatg 
Ctgctcccgc 
adaattcatc 
agtcaaaaaa 
agaaggcaat 
ttactttgcc 
gttcctcttc 
gagtgtcttc^ 
ccaattcggc 
agtgaaagag 
cttcatactc 
catcatgccg 
tcatgtcctt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 



. ttcfjcgttcc acatcatagg; tiggtcccttt ai:accggctg tccgtcattt ttaaatsitag li860 
gtttjtc^ttt tctcccacca: gcttatataG cttagcagga gacattcctt ccjgtatcttt 1920 
tacg(cagcgg. tatttttcga; tcagtttttt caattccggt gatattctca ttttageicat 1980 
* . ttatt^tttc cttcctcttt- tctacagtat ttaaagatac cccM taattataac 2040 

. aag^cgaact ccaattcact: gttccttgca ttctaaaacc ttaaatacca gaaaacagct 2100 
tttl^caaagt tgttttgaaa; gttggcgtat aacatiagtat cga;cggagcc gattttgaaa 2160 
ccacaattat gatagaatt-t' acaagctata aggtt^ttgt cct'gggtttc aagcatta'gt .2220 
Gcatigcaagt ttttatgctt tgcccattct atagatatat tgataagcgc gctgcctatg 2280 
ccttjgccccc tgaaatcctt. acatacggcg ktatcttcta tataaaagat atattatctt 2340 
atcaSgtattg tcaatatatt caaggcaatc tgcctbctca tcctcttcat cctcttdgtc 2400. 
ttggtagctt tttaaatafcg gcgcttcata gagta'attct gtaaaggtcc aattctcgtt 24 60 
ttc^tacctc ggtataatct tacctatcac ctcaaiatggt tcgbtgggtt tatcgcdtga 2520. 
tgcggtattt tctccttacg catctgtgcg gtattteacg tcgacgcggc cgcdatgrgcc 2580 
. gcgggatccc ggtaccgaaa catcgttaga tttccjtccta aattgacaaa ctaaatatct 2640 
.: gatajatttaa catattctca aaagagtgtc aacgtgtatt gac'gcagtaa aggataaaag 2700 
. taa^gcctaa. taaatcaatg atctgacagc ttgcaggtaa tatatttaat ttgaagdaat 27 60 
• • tctotataca gccaaccagt tatcgtttat aatgtiaatta aatttcatat gatcaatctt 2820 
cggggcaggg tgaaattccc taccggcggt gatgagccaa tggctctaag cccgcgagct 2880 
gtctttacag caggattpgg tgagattccg gagccgacag tacagtctgg atgggagaag 2940 
atiggaggttc ataagcgttt tgaaattgaa tttttcaaac gtttctttgc ctagcctaat 3p00 
: tttcgaaacc ccgcttttat atatgaagcg gtttt.tttat tggctggaaa agaacctttc 3060 
cgttttdgag taagatgtga tcgaaaagga gagaat:gaag tgaaagtaaa aaaattagtt 3120 
gtggtcagca tgctgagcag cattgcattt gttttgatgc tgttaaattt cccgttliccg 3180 
ggtcttpcgg attatttaaa aatcgatttt agcgacgttc ccgcaattat tgccattctg 3240 
atttiiacggac ctttggcggg atcactagag ggctcccaac gcgttggatg catagcttga 3300 - 
gtattctata gtgtcaccta aatagcttgg cgtaatcatg gtcatagctg tttcctgtgt 3360 
gaaattgtta tccgctcaca attccacaca acatacgagc cggaagcata aagtgtaaag 3420* 
cct^gggtgc ctaatgagtg agctaactca cattaattgc gttigcgctca ctgcccgctt .3480 
tcc^gt.cggg aaacctgtcg tgccagctgc attaatgaat eggccaacgc gcggggagag 3540 
gcggtttgcg- tattgggcgc tcttccgctt cctcgctcac tgactcgctg cgctcggtcg 3600 
ttcggctgcgi gcgagcggta tcagctcact caaaggcggt aatacggtta tccacagaat 3660 . ^: 
caggggataa; cgcaggaaag aacatgtgag caaaaggcca gcaaaaggcc aggaacqgta 3720 
aaaaggpcgc: gttgctggcg tttttcgata ggctccgccc ccctgacgag catcacaaaa 3780 
atcgacgctc= aagtcagagg tggcgaaacc cgacaggact ataaagatac .caggcgtttc 3840 
cccctggaag ctccctcgtg cgctctcctg ttccgaccct gccgcttacc ggatacctgt 3:900 
. ccgcctfctct cccttcggga agcgtggcgc tttctqatag ctcacgctgt aggtatctca 3960 
gttcggtgta ggtcgttcgc tccaagctgg gctgtgtgca cgaacccccc gttcagcccg 4020 
accgctgcgc cttatccggt aactatcgtc tt^gagtccaa cccggtaaga cacgacttat 4080 
cgccactggc: agcagccact ggtaacagga ttagcagagc gaggtatgta ggcggtgcta 4140 
I cagagttctt gaagtggtgg cctaactacg gctacactag aaggacagta tttggtatct 4200 
gcgctctgct gaagccagtt accttcggaa aslagagttgg tagctcttga tccggcaaac 4260 
aaaccaccgc tggtagcggt ggtttttttg tttgcaagca geaigrattacg cgcagaaaaa 4320 
aaggatptca agaagatcct ttgatctttt ctacgg^gtc tgacgctcag tggaacgaaa 4380. 
act9acgtta agggattittg gtcatgagat tatpaaaaag gatcttcacc tagatccttt 4440 
: taaattaaaa.: ^tgaa'gt.ttt aaatca&tct aaagtatata tgagtaaact tggtctgaca 4500 
:• • gttacca^tg cttaatcagt gaggcaccta tbtcagcgat ctgtctattt cgttcatcca 4560 
: . . . \ : tagttgcctg actccccgtc gtgtagataa ctacgatacg ggagggctta ccatctggcc 4620 
:' ccagtgctgc aatgataccg cgagacbcac . gctcaccggc tccagiattta. tcagcaataa 4680 
r . accaigccagc cggaagg.gcc gagcgcagaa gtggtcGtgc aactttatcc gcctccaitcc 4740 
;. . , agtbtattaa ttgttgccgg gaagctagag taagtagttc gccagttaat agtttgcgca 4800 
: acgttgttgg cattgctada ggcatcgtgg tgtcacigctc gtcgtttggt atggcttoat 4860 
. _ . tcagctccgg ttcccaabgfa tcaaggcgag ttacatgatc ccccatgttg tgcaaaa;aag 4 920 
> cggttagctc cttcggtcct ccgatcgttg tcagaagtaa gttggccgca gtgttat;cac 4^980 
tcatggttat ggcagcafcl^g cataattctc ttactgtcat gccatccgta agatgct?ttt ' 5040 
ctgtgactgg tgagtactca accaagtcat tctgagaata ccgcgcccgg cgaccgagtt . 5100 
gctcttgccc ggcgtcaata cggglataata gtgtatgaca -tagcagaact ttaaaagtgc 5160 
; tcatcattgg aaaacgttdt tcggggcgaa aactctcaag gatcttaccg ctgttga^at . 51220 

ccagttpgat gtaacccact cgtgcaccca actgatcttc agcatbtttt actttc^^^ca i sigeo . • 
' gcgtttirtgg. gtgagcaaaia acaggaaggc aaaatgccgc as^aaaaggga ataagggicga 5340 
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cacggaaatg 
^ttattgtct 
. ttccgcgcac 
cgtaaggaga 
cgttaaatat 
cttataaatc 
gtccactatt 
atggcccact 
ctctaaatcg 
acgtggcgag 
tagcggtcac 



ttgaatactc 
catigagcgga 
atttccccga 
aaataccgca 
ttgttaaatc 
aaaagaatag 
aaagaacgtg 
acgtgaacca 
gaaccctaaa 
aaaggaaggg 
gctgcgcglia 



atactcttcc 
itacatatttg 
aaagtgccac 
tcaggcgaaa 
agctcatttt 
accgagatag 
gactccaacg 
tcacccaaat 
igggagccccc 
aagaaagcga 
accaccacac 



tttttcaata 
aatgtattta 
ctgtatgcgg 
ttgtaaacgt 
ttaaccaata 
ggttgagtgt 
tcaaagggcg 
caagtttttt 
gatttagagc 
aaggagcggg 
ccgccgcgct 



ttattgaagc 
^aaaaataaa 
tgtgaaatac 
taatattttg 
ggccgaa^tc 
tgttccagtt 
aaaaaccgtc 
gcggtcgagg 
ttgacgggga 
cgctagggcg 
taa 



atttatcagg 
ca'aatagggg 
cgcacagatig 
ttaaaattcg 
ggcaaaatcc 
tggaacaaga 
tatcagggcg 
tgccgtaaag 
aagccggcga 
ctggca&gtg 



5400 
54 60 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
5983 
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<213> Artif iciai Sequence 

<220> - 
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<400> 31 

ttgcggccgb 

gacattagaa 

tcattaggcc 

tcacaaacag 

atgaattttc 

ttaaacccag 

ggtgttttgg 

tcataaaact 

acaccatcaa 

ctattgtaac 

aatgcagggt 

atttctgtgg 

ctcttccaat 

atctaaagtg 

taaaagtcaa 

tttcgtttgt 

ggtcttttgt 

tgataataag 

tccgtcagga 

cgqcaccctc 

ctcaggagag 

gcctttcgtt 

accatcggcg 

gctactgccg 

tatcaggctg 

agcittcacag 

tgtctgtcaa 

atagttgcaa 

gttgtatctt 

ccaittgatct 

gtgctgcggt 

gaaatttttt 

ttaacagtag 

gatgtttcaa 

ttcatgcact 

ttagcaaatt 

tttgcgaact 

gttgktgctc 



ttcgaaagct 

aaccgactgt 

tatctgacaa 

aatgatgtac 

ctgctgtaat 

taaatgaagt 

gaaacaattt 

ctttgaagtc 

aaattgtata 

cagttctaaa 

aaaatttata 

ttatactaaa 

tgtctaaatc 

aatttaggag 

tattactgta 

tgaactaatg 

gtttttttaa 

ggtaactatt 

tggccttctg 

cgggccgttg 

cgttcaccga 

ttatttgatg 

ctacggcgtt 

ccaggcaaat 

aaaatcttct 

acacaatatc 

aggaaagcat 

tgttaatatc 

gatgctggat 

cgacaattcg 

ctcctgtcac 

cactgaagct 

agtctacgcg 

gtttagcaat 

gtgatacaag 

catcttttgt 

gcagaacttc 

ccaagtgagg 



gtaatataaa 

aaaaagtaca 

ttcctgaata 

ctgtaaagat 

aatgggtaga 

ccatggaata 

ccccgaacca 

attctttaca 

aagtggctct 

agctgtattt 

tccttcttgt 

agtcgtttgt 

aattttatta 

gcttacttgt 

acataaatat 

ggtgctttag^ 

aggatttgag 

.gaattcggta 

cttaatttga 

cttcgcaacg 

caaacaacag 

ectggcagtt. 

•tcacttctga 

tctgt]tttat 

ctcatccgcc 

agggacattt 

catgatggct 

:attatctccg 

atacaccaag 

ittctccgaaa 

ttttacgctg; 

aatgccgcgt 

cggttttaaa 

tgtgccttca 

gctgccaatt 

catggcaggc 

ttctgacact: 

agtggcaatg: 



aaccttcttc 
gtcggcatta 
gagttcataa 
agcggtaaat 
aggtaattac 
atagaaagag 
ttatatttct 
ggagtccaaa 
aacttatccc 
gagtttatca 
tttatgtttc 
tggttcaaat 
aagttcattt 
ctgctttctt 
atattttaaa. 
ttgaagaata 
cgtagcgaaa 
ccaagagttt 
tgcctggcag 
ttcaaatccg 
ataaaacgaa 
ccctactctc • 
gttcggcatg 
cagaccgctt 
aaaacaggat 
gttagttctt 
tctccgcctt 
agaatacgtc 
tgaccagtcg 
tgaggaatat 
atgcagttat 
tcttttgcga ; 
aagcctgaca 
tattgaatgg 
tttcctgcaa 
aggttgatag 
tgggcggcga 
actaatggat 



aactaacggg 

tctcatatta 

acaatcctgc 

atattgaatt 

tattattatt 

aaaaagcatt 

ctacatcaga 

t:accagagaa: 

aataacctaa 

cccttgtcac 

ggtataaaac 

aatgattaaa 

gatatgcctc 

cattagaatc 

aatatccdac 

aagaccacat 

aatccttttc 

gtagaaacgc 

tttatggcgg 

ctcccggdgg 

aggcccagtc 

gcatggggag 

gggtcaggtg. 

ctgcgttctg 

ecaattatgg 

tcacaatttt 

tttccttacg 

ctactcggcc 

gataaaaatc; 

acgtagcegt' 

cgtatccaga- 

cacccccggc 

gaagggcttt 

caacatcctg 

tttgatggta 

ctgacatgac- 

cattgagctg 

gatcaacaag 



gcaggttagt 60 
taaaagccag 120 
atgataacca 180 
acctttatta 240 
gatatttaag 300 
ttcaggtata 360 
aaggtataaa 420 
tgttttagat 480 
ctctccgtcg 540 
taagaaaata 600 
actaatatca 660 
tatctctttt 720 
ctaaattttt 780 
aatcettttt 840 
tttatccaat 900 . 
taaaaaatgt 960 
tttcttatct 1020 
aaaaaggcca 1080 
gcgtcctgcc 1140 
atttgtccta 1200 
tttcgactga 1260 
accccacact 1320 
ggaccabcgc 1380 
atttaatctg 14 40 
cagatcaatg 1500 
atcttccaga 1560 
gccaacctgc 1620 
gatgacacct 1680 
aatattaaat 1740 
tacagtaaag 1800 
ttcagaagag I860 
attgacctca 1920 
tgtaatgaac 1986 
tactggttct 2040 
aggctt^att 2100 
aggcaggcct 2160 
tgcttctttc 2220 
tttgttgtca 2280 
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. abtggcggtt cgapttcgaa aacgl^caagc gctgctcccg caacatgccc gttttccaaa 2340 
gcttcgagaa gtgctgcttc atcgataatt ccgcctcgcg cacagttaat taagcgaacg 2400 
ccttttttcg tttttgcaat cgtttcttta ttcaataagc cttttgtttc ttttgttaaa 24 60 
ggcgtgtgaa cggtaatgat atccgcactt tcaagcactt cttcaaatgt acggctgttt 2520 
acgccgattt ttttcgctct ttcttccgtt aagaaaggat caaa.aacgtg cacagtcata 2580 
ccgaacgctG ctcgacgctg tgcaatttca cttccgattc ggcctaatcc tacaatacca 2640 

. agcgtttttc cataaagctc tgaaccgapa taagctgtgc ggttccactc tctggatttq 2700 

. abtgagatat . tagcctgcgg aatgtgtctc attaaagaag agatcattgc aaatgtatgc 2760 
tcagctgtcg aaatggtgtt gccgttcgga gcattgatca cgattacccc gtgtttcgta 2820 
gcctcatcaa tatcgatatt atcgacaccg acaccggctc ttccgacaat ttttaaagaa 2880 
gtcattttgt tgaaaaggtc ttctgttact tttgtcgcgc ttcgcaccaa aagagcatca 2940 
aaagtatgta attcatcttc tgcatctgct acgttttttt gaacgatttc aataaagtct 3000 
gattcaataa gtggctgtaa accgtcgttg ctcattttgt ctgagaccaa tactcgaaac 3060 
atgttttctc ctcctctaga gcgtcctgct gttgttaaga ttattatacc acaccttgta 3120 
gataaagtca acaacttttt gcaaaatttt tcaggaattt tagcagaggt tgttctggat 3180 
gtagaacaaa acatctttcc gctcttgtgc tgttaggata tctttcttgg aagctaggta 324 0 
ggcctcgagt tatggcagtt ggttaaaagg aaacaaaaag accgttttca cacaaaacgg 3300 

. tctttttcga tttcttttta cagtcacagc eacttttgca aaaaiccggac agcttcatgc 3360 
cttataactg ctgtttcggt cgacgaaaca tcgttagatt tcetcctaaa ttgacaaact 3420 
aaatatctga taatttaaca tattctcaaa agagtgtcaa cgtgtattga cgcagtaaag 3480 
gataaaagta aagcctaata aatcaatgat ctgacagctt gcaggtaata tatttaattt 3540 
gaagcaattc tctatacagc caaccagtta tcgtttataa tgtaattaaa tttcatatga 3600 
tcaatcttcg gggcagggtg aaattcccta ccggcggtga tgagccaatg gctctaagcc 3660 
cgcgagctgt ctttacagca ggattcggtg agattccgga gccgacagta cagtctggat 3720 
gggagaagat ggaggttcat aagcgttttg aaattgaatt tttcaaacgt ttctttgcct 3780 
agcctaattt tcgaaacccc gcttttatat atgaagcggt ttttttattg gctggaaaag 3840 
aacctttccg ttttcgagta agatgtgatc gaaaaggaga gaatgaagtg. aaagtaaaaa 3900 
apttagttgt ggtcagcatg caagcttcgc gaagcggccg ccgacgcgag gctggatggc 3960 . 

:.cttccccatt atgattettc tegcttccgg cggcatcggg atgcccgcgt tgcaggccat 4020 . 
gctgtccaggcaggtagatg acgaccatca gggacagett caaggatcgc tcgcggctct 4080 
taccagccta acttcgatca ctggaccgct gatcgtcacg gcgatttatg ccgcctcggc 414 0 
gagcacatgg aacgggttgg catggattgt aggcgccgcc ctataccttg tctgcctccc 4200 
cgcgttgcgt cgcggtgcat ggagccggigc cacctcgacc tgaatggaag ccggcggcac 4260 
ctcgctaacg gattcaccac tccaagaatt ggagccaatc aattcttgcg. gagaactgtg 4320. 
aatgcgcaaa ccaacccttg gcagaacata tccatcgcgt ccgccatctc cagcagccgc 4380 
acgcggcgca tctcgggcag cgttgggtbc tggccacggg tgcgcatgat cgtgctcctg 44 40 
tcgttgagga cccggctagg ctggcggggt tgccttactg gttagcagaa tgaatcaccg 4500 
atacgcgagc gaacgtgaag cgactgctgc tgcaaaacgt ctgcgacctg agcaacaaca 45 60 
tgaatggtct tcggtttccg tgtttcgtaa. agtctggaaa cgcggaagtc agcgccctgc 4620 
accattatgt tccggatctg catcgcag/ga tgctgctggc taccctgtgg aacacctaca 4680 
tctgtattaa cgaagcgctg gcattgaccc tgagtgattt ttctctggtc ccgccgcatc 47 40 
cataccgcca gttgtttacc ctcacaacgt tccagtaacc gggcatgttc atcatcagta 4800 
acccgtatcg tgagcatcct ctctcgtttc atcggtatca ttacccccat gaacagaaat 4860 
tcccccttac acggaggcat caagtgacca aacaggaaaa aaccgccctt aacatggccc 4920 
gctttatca^ aagccagaca ttaacgcttc tggagaaact caacgagctg gacgcggatg 4980 
aacaggcaga catctgtgaa tcgcttcacg accacgctga tgagctttac cgcagctgcc 5040 
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 5100 
cjagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagegggtg 5160 
ttggegggtg tcggggcgca gccatgaccc agtcacgtag cgatagcgga gtgtatactg 5220 
gjcttaactat gcggcatcag agcagattgt actgagagtg caccatatgc ggtgtgaaat 5280 
aiccgcabaga tgcgtaagga gaaaataccg catcaggcgc tcttccgctt Cctcgctcac 5340 
tlgactcgctg cgctcggtcg ttcggctgcg gcgagcggta tcagctcact caaaggcggt 5400 
aatacggtta tccacagaat caggggataa cgcaggaaag aacatgtgag caaaaggcca 54 60 
gcaaaaggcc aggaaccgta aaaaggccgc gttgctggcg tttttccata ggctccgccc 5520 
ccctgacgag catcacaaaa atcgacgctc aagtcagagg tggcgaaacc cgacaggact 5580 
ataaagatac caggcgtttc cccctggaag ctccctcgtg cgctctcctg ttccgaccct 5640 
gccgcttacc ggatacctgt ccgcctttet cccttcggga agcgtggdgc tttctcatag 5700 
ctcacgctgt aggtatctca gttcggtgta ggtcgttcgc tccaagctgg gctgtgtgca 57 60 

cgaajDccccc gttcagcccg accgctgcgc cttatccggt aactatcgtc ttgagtcc'aa 5820 
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,\ ccjcggtaaga (bacgacttat cgccactggc agcagccact ggtaadaggi ttagcagagc 5880 
; . gaggtatgrta ggcggtgcta cagagttctt ; gaagtggtgg cctaactacg gctacactag 5940 
^ aaggacagta tttggtatct gcigctctgct gaagccagtt accttcggaa aaagagttgg 6000 
: tagct.cttga tccggcaaac aakccaccgc ' tggtagcggt g^tttttttg tttgcakgca 6060 

gcagattacg cgcagaaaaa aaggatctca agaagatcct ttgatctttt ctacggggtc 6120 
. : tgacgptcag tggaacgaaa actcacgtta ; agggattttg gtcatgagat tatcaaaaag 6180 
gatcttcacc tagatccttt taaattaaaa ; atgaagtttt aaatcaatct aaagtata=ta .6240 
. tgagtaaact tggtctgaca gttacchatg cttaatcagt gaggcaccta tctcagcgat 6300 
ct^ctattt Cgttcatcca tagttgbctg actccccgtc gtgtagataa ctacgatacg 6360 ^ 
ggagggctta ccatctggcc ccagtgptgc ; aatgataccg cgagacccac gctcaccggc 6420 
. . tccagattta tcagcaataa accaigcbagc ; cggaagggcc gagcgcagaa gtggtcctgc 6480 
I aactttatcc gcctccatcc agtctattaa , ttgttgccgg gaagctagag taagtagttc 6540 : 

} gccagttaat agtttgcgca acgttgttgc cattgctgca ggcatcgtgg tgticacgctc 6600 

gtcgtttggt atggcttcat tcagbtccg^ ttccca'acga tcaaggcgag ttacatgatc 6660 - 
ccccatgttg tgcaa^aaag. cggtta^ctc cttcggtcct ccgatcgttg tcagaagtaa 6720 
gttggccgca : gtgttatcac tcatggttat ggcagcactg cataattctc ttactgtcat 6780 
: gccatccgtajagatgctttt ctgtgactg^ tgagtactca acckagtcat tctgagaata 6840 
. gtgtatgcgg cgaccgagtt gctcttgccc ggcgtcaata cgggataata ccgbgccaca 6900 

. : tagcagaact ttaaaagtgc tcatlcattgg aaaacgttct tcggggcgaa aactctcaag 6960' . 

gatcttaccg : ctgttgaigat ccagttcgal: gtaacdcact cgtgcaccca actgatcttc: 7020 
\ ^ agcatctttt actttcacca gcjgtttctgg gtgagcaaaa acaggaaggc aaaatgcfcgc 7080 
aaaaaaggga ataagggcga cacggakatg . ttgaatactc atactbttcc tttttcaata 7140 
' ■ ttattgaagc^tttatcagg gttattgtct catgag[cgga tacatatttg aatgtattta 7200 
gaaaaataaa; caaatagggg ttccgcgcac ; atttccccga aaagtgccac ctgacgtcta 7260 
agaaaccatt attatcatga cattaaccta taaaaatagg cgtatcacga ggccctttcg 7320 
; tcttcaagaa, % ■ . 7330 • 



